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Abstract 

As software systems are getting increasingly connected, there is a need for equipping nonmonotonic 
logic programs with access to external sources that are possibly remote and may contain information 
in heterogeneous formats. To cater for this need, HEX programs were designed as a generalization 
of answer set programs with an API style interface that allows to access arbitrary external sources, 
providing great flexibility. Efficient evaluation of such programs however is challenging, and it requires 
to interleave external computation and model building; to decide when to switch between these tasks 
is difficult, and existing approaches have limited scalability in many real-world application scenarios. 
We present a new approach for the evaluation of logic programs with external source access, which is 
based on a configurable framework for dividing the non-ground program into possibly overlapping 
smaller parts called evaluation units. The latter will be processed by interleaving external evaluation 
and model building using an evaluation graph and a model graph, respectively, and by combining 
intermediate results. Experiments with our prototype implementation show a significant improvement 
compared to previous approaches. While designed for HEX-programs, the new evaluation approach 
may be deployed to related rule-based formalisms as well. 

KEYWORDS'. Answer Set Programming, Model Building, External Computation, HEX Programs 
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1 Introduction 


Motivated by a need for knowledge bases to access external sources, extensions of declara¬ 
tive KR formalisms have been conceived that provide this capability, which is often realized 
via an API-style interface. In particular, HEX programs ( Eiter et al, 2005| extend nonmono¬ 
tonic logic programs under the stable model semantics with the possibility to bidirectionally 
access external sources of knowledge and/or computation. E.g., a rule 
pointsTo(X,Y) &hasHyperlink[X](Y), url(X) 
might be used for obtaining pairs of URLs (X, Y), where X actually links Y on the Web, 
and SchasHyperlink is an external predicate construct. Besides constants (i.e., values) as 
above, also relational knowledge (predicate extensions) can flow from external sources to the 
logic program and vice versa, and recursion involving external predicates is allowed under 
safety conditions. This facilitates a variety of applications that require logic programs to 


interact with external environments, such as querying RDF sources using SPARQL (Polleres 


2007) 1, default rules on ontologies (Hoehndorf et al. 2007 Dao-Tran et al. 2009] >, complaint 
management in e-government (Zirtiloglu and Yolum 2008 ), material culture analysis (Mosca 


and Bernini 20081, user interface adaptation (Zakraoui and Zagler 201 2), multi-context 


reasoning (Brewka and Eiter 2007), or robotics and planning (Schiiller et al. 2013 Havur 
jet al. 2014] ), to mention a few. 

Despite the absence of function symbols, an unrestricted use of external atoms leads to 
undecidability, as new constants may be introduced from the sources; in iteration, this can 
lead to an infinite Herbrand universe for the program. However, even under suitable restric¬ 
tions like liberal domain-expansion safety (Eiter et al. 2014a) that avoid this problem, the 
efficient evaluation of HEX-programs is challenging, due to aspects such as nonmonotonic 
atoms and recursive access (e.g., in transitive closure computations). 

Advanced in this regard was the work by jEiter et al. (2012) , which fostered an evaluation 
approach using a traditional LP system. Roughly, the values of ground external atoms 
are guessed, model candidates are computed as answer sets of a rewritten program, and 
then those discarded which violate the guess. Compared to previous approaches such as 
the one by Eiter et al, (2006) , it further exploits conflict-driven techniques which were 


extended to external sources. A generalized notion of Splitting Set (Lifschitz and Turner 


1994[ ) was introduced by Eiter et al. (2006) for non-ground HEX-programs, which were 


then split into subprograms with and without external access, where the former are as large 
and the latter as small as possible. The subprograms are evaluated with various specific 
techniques, depending on their structure (Eiter et al. 2006; Schindlauer 2006). However, for 
real-world applications this approach has severe scalability limitations, as the number of 
ground external atoms may be large, and their combination causes a huge number of model 
candidates and memory outage without any answer set output. 

To remedy this problem, we reconsider model computation and make several contribu¬ 
tions, which are summarized as follows. 


• We present a modularity property of HEX-programs based on a novel generalization of 
the Global Splitting Theorem (Eiter et al. 2006j ), which lifted the Splitting Set Theorem (Of] 
|schitz and Turner 1994 1 to HEX-programs. In contrast to previous results, the new result is 
formulated on a rule splitting set comprising rules that may be non-ground, moreover it is 
based on rule dependencies rather than atom dependencies. This theorem allows for defining 
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answer sets of the overall program in terms of the answer sets of program components that 
may be non-ground. 

• Moreover, we present a generalized version of the new splitting theorem which allows 
for sharing constraints across the split; this helps to prune irrelevant partial models and 
candidates earlier than in previous approaches. As a consequence — and different from 
other decomposition approaches— subprograms for evaluation may overlap and also be 
non-maximal (resp. non-minimal). 

• Based on the generalized splitting theorem, we present an evaluation framework that 
allows for flexible evaluation of HEX-programs. It consists of an evaluation graph and a 
model graph ; the former captures a modular decomposition and partial evaluation order of 
the program, while the latter comprises for each node collections of sets of input models 
(which need to be combined) and output models to be passed on between components. This 
structure allows us to realize customized divide-and-conquer evaluation strategies. As the 
method works on non-ground programs, introducing new values by external calculations is 
feasible, as well as applying optimization based on domain splitting ( |Eiter et al. 2009 [ >. 

• A generic prototype of the evaluation framework has been implemented which can be 
instantiated with different solvers for Answer Set Programming (ASP) (in our suite, with dlv 
and clasp). It also features model streaming , i.e., enumeration of the models one by one. In 
combination with early model pruning, this can considerably reduce memory consumption 
and avoid termination without solution output in a larger number of settings. 

Applying it to ordinary programs (without external functions) allows us to do parallel 
solving with a solver software that does not have parallel computing capabilities itself 
(‘parallelize from outside’). 


2012) , is organized as follows. In Section[2]we present the HEX-language and consider an 
example to demonstrate it in an intuitive way; we will use it as a running example throughout 
the paper. In Section[3]we then introduce necessary restrictions and preliminary concepts 
that form dependency-based program evaluation. After that, we develop in Section[4]our 
generalized splitting theorem, which is applied in Section[5]to build a new decomposition 
framework. Details about the implementation and experimental results are given in Section[6] 
After a discussion including related work in Section [7j the paper concludes in Section [8] 
The proofs of all technical results are given in | Appendix A| 

2 Language Overview 

In this section, we introduce the syntax and semantics of HEX-programs as far as this is 
necessary to explain use cases and basic modeling in the language. 

2.1 HE X Syntax 

Let C, X, and Q be mutually disjoint sets whose elements are called constant names, 
variable names, and external predicate names, respectively. Unless explicitly specified, 
elements from X (resp.. C) are denoted with first letter in upper case (resp.. lower case), 
while elements from Q are prefixed with ‘ & ’. Note that constant names serve both as 
individual and predicate names. 


This paper, which significantly extends work in (Eiter et al. 2011) and parts of (Schuller 
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Elements from C U X are called terms. An atom is a tuple (Yq. Y\,... ,Y n ), where 
Yq, ... ,Y n are terms; n > 0 is the arity of the atom. Intuitively, Y.) is the predicate name, 
and we thus also use the more familiar notation Yo(Y \,..., Y n ). The atom is ordinary (resp. 
higher-order), if Vi is a constant (resp. a variable). An atom is ground, if all its terms 
are constants. Using an auxiliary predicate aux n for each arity n, we can easily eliminate 
higher-order atoms by rewriting them to ordinary atoms aux n (Yo ,..., Y n ). We therefore 
assume in the rest of this article that programs have no higher-order atoms. 

An external atom is of the form 

&g[Y 1 ,...,Y n \(X 1 ,...,X m ), (1) 

where Y\,..., Y n and Xi ,..., X m are two lists of terms (called input and output lists, 
respectively), and &g G Q is an external predicate name. We assume that &g has fixed 
lengths in(&g) = n and out(&g) = m for input and output lists, respectively. 

Intuitively, an external atom provides a way for deciding the truth value of an output tuple 
depending on the input tuple and a given interpretation. 

Example 1 

(a, b, c), a(b , c), node{X ), and D(a, b) are atoms; the first three are ordinary, where the 
second atom is a syntactic variant of the first, while the last atom is higher-order. 

The external atom &reach[edge, a] (X) may be devised for computing the nodes which 
are reachable in a graph represented by atoms of form edge(u, v) from node a. We have for 
the input arity in(&reach) = 2 and for the output arity out(&reach) = 1. Intuitively, given 
an interpretation I, &reach[edge, a](A) will be true for all ground substitutions X i—>■ b 
such that 6 is a node in the graph given by edge list {(zt, v) \ edge(u , v ) G /}, and there is a 
path from a to b in that graph. 

Definition 1 (rules and HEX programs) 

A rule r is of the form 

cti V • • • V a k <- pi,... ,f3 n ,not /3„+i,.. .,not /3 m , m,k> 0, (2) 

where all a, are atoms and all j3j are either atoms or external atoms. We let H(r ) = 
{«!,... ,«fc} and B{r) = B + (r ) U B~(r), where B + (r) = {/?i,... ,/3 n } and B~(r) = 
{Pn+i, ■ ■ ■ i Pm}- Furthermore, a (HEX) program is a finite set P of rules. 

We denote by const(P) the set of constant symbols occurring in a program P. 

A rule r is a constraint, if H(r) = 0 and B(r) 7 ^ 0; a. fact, if B(r) = 0 and H{r) 7 ^ 0; 
and nondisjunctive, if | //( r) < 1. We call r ordinary, if it contains only ordinary atoms. 
We call a program P ordinary (resp., nondisjunctive), if all its rules are ordinary (resp., 
nondisjunctive). Note that facts can be disjunctive, i.e., contain multiple head atoms. 

Example 2 (Swimming Example) 

Imagine Alice wants to go for a swim in Vienna. She knows two indoor pools called 
Margarethenbad and Amalienbad (represented by margB and amalB, respectively), and 
she knows that outdoor swimming is possible in the river Danube at two locations called 
Gansehaufel and Alte Donau (denoted gansD and altD, respective I y) £] S he looks up on 

1 To keep the example simple, we assume Alice knows no other possibilities to go swimming in Vienna. 
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{ location(ind , margB), location(ind, amalB). 
location(outd, gansD), location(outd, altD) 


rv 

swim (ind) V swim(outd) <- 


r 2 \ 

need (inoutd, C) -t 

- &rq[swim](C). 

V3'- 

goto(X) V ngoto(X) -f 

- swim(P), location(P, .A). 

r 4 : 

go <r 

- goto(X). 

rs- 

need(loc, C) -f 

- &rq[goto](C). 

c 6 : 


- goto(X), goto(Y), X ^ Y. 

cr. 


- not go. 

c 8 : 


- need(X, money). 


Fig. 1: Program If 


jIDB 


_ phjDri i i p 

r swim ^ r swim 


to decide swimming location 


the Web whether she needs to pay an entrance fee, and what additional equipment she will 
need. Finally she has the constraint that she does not want to pay for swimming. 

The HEX program P swim = Pj^im u ^IwiL shown in Figure [^represents Alice’s reason¬ 
ing problem. The extensional part P^IH contains a set of facts about possible swimming 
locations (where ind and outd are short for indoor and outdoor, respectively). The in- 
tensional part incorporates the web research of Alice in an external computation, 

i.e., using an external atom of the form &rq[location-choice\(required-resource), which 
intuitively evaluates to true iff a given location-choice requires a certain required-resource 
and represents such resources and their origin ( inoutd, or loc ) using predicate need. As¬ 
sume Alice finds out that indoor pools in general have an admission fee, and that one also 
has to pay at Gansehaufel, but not at Alte Donau. Furthermore Alice reads some reviews 
about swimming locations and finds out that she will need her Yoga mat for Alte Donau 
because the ground is so hard, and she will need goggles for Amalienbad because there is 
so much chlorine in the water. 

We next explain the intuition behind the rules in P SW im'- tt chooses indoor vs. outdoor 
swimming locations, and r 2 collects requirements that are caused by this choice. Rule r 3 
chooses one of the indoor vs. outdoor locations, depending on the choice in rq, and r 5 
collects requirements caused by this choice. By r,\ and we ensure that some location is 
chosen, and by eg that only a single location is chosen. Finally eg rules out all choices that 
require money. Note that there is no apparent requirement for the first argument of predicate 
need, however this argument ensures, that r 2 and r 5 have different heads, which becomes 
important in Example [73] 

The external predicate &rq has input and output arity in(&rq) = out(&rq) = 1. Intu¬ 
itively &rq[a](f3) is true if a resource /3 is required when swimming in a place in the 
extension of predicate a. For example, &rq[swim\(money) is true if suiim(ind) is true, 
because indoor swimming pool charge money for swimming. Note that this only gives an 
intuitive account of the semantics of &rq which will formally be defined in Example [4] 


2.2 HEX Semantics 


The semantics of HEX-programs (Eit er et al. 2006| Schindlauer 2006) generalizes the 
answer-set semantics ( Gelfond and Lifschitz 1991) . Let P be a HEX-program. Then the 
Herbrand base of P, denoted HBp, is the set of all possible ground versions of atoms and 
external atoms occurring in P obtained by replacing variables with constants from C. The 
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grounding of a rule r, grnd(r ), is defined accordingly, and the grounding of P is given by 
gmd(P) = U reP grnd(r). Unless specified otherwise, X and Q are implicitly given by P. 
Different from the ‘usual’ ASP setting, the set of constants C used for grounding a program 
is only partially given by the program itself; in HEX, external computations may introduce 
new constants that are relevant for semantics of the program. 

Example 3 ( ctd .) 

In Pswim the external atom &rq can introduce constants yogamat and goggles which are 
not contained in P SW im , but they are relevant for computing answer sets of P SW im- 

An interpretation relative to P is any subset I C HBp containing no external atoms. We 
say that I is a model of atom a £ HBp, denoted I \= a, if a £ I. 

With every external predicate name &g £ Q, we associate an (n+ro+l)-ary Boolean 
function (called oracle function) f& g assigning each tuple ( 7 , yi ..., y ni .. ., x m ) either 
0 or 1, where n = in(&g), m = out(&g), I C HBp , and Xi,yj £ C. We say that I C 
HBp is a model of a ground external atom a = Szg[yi, ..., y n ](x i,..., x m ), denoted I \= a, 
if f&g(I,yi ■ ■ - Vn, Xi, . . .,X m ) = 10 

Note that this definition of external atom semantics is very general; indeed an external 
atom may depend on every part of the interpretation. Therefore we will later (Section [XT] ) 
formally restrict external computations such that they depend only on the extension of those 
predicates in 7 which are given in the input list. All examples and encodings in this work 
obey this restriction. 

Example 4 (ctd.) 

The external predicate &rq in P SW im represents Alice’s knowledge about swimming loca¬ 
tions as follows: for any interpretation I and some predicate (i.e., constant) a, 

I |= &rq[a](money) iff f& rq (I, a, money) = 1 iff a(ind) £ I or a(gansD) £ 7, 

7 ^ &rq[a](yogamat) iff f& rq (I, a, yogamat) = 1 iff a(altD) £ 7, and 
7 ^ &rq[a](goggles) iff f& rq (I, a, goggles) = 1 iff a(amalB) £ I. 

Due to this definition of f&rq, it holds, e.g., that {swim(ind)} ^ &rq[swim](money). 
This matches the intuition about &rq indicated in the previous example. 

Let r be a ground rule. Then we say that 

(i) 7 satisfies the head of r, denoted 7 \= H(r), if 7 \= a for some a £ H(r ); 

(ii) 7 satisfies the body of r (I \= B(r)), if 7 \= a for all a £ B + (r) and 7 a for all 

a £ B~(r); and 

(iii) 7 satisfies r (7 \= r), if 7 \=H(r) whenever 7 |= B(r). 

We say that 7 is a model of a HEX-program P, denoted 7 |= P, if 7 \= r for all r £ grnd(P). 
We call P satisfiable , if it has some model. 

Definition 2 (answer set) 

Given a HEX-program P, the FLP-reduct of P with respect to 7 C HBp , denoted fP 1 , is 
the set of all r £ grnd(P) such that I \= B(r). Then 7 C HBp is an answer set of P if, 7 
is a minimal model of fP 1 . We denote by AS(P) the set of all answer sets of P. 

2 In the implementation. Boolean functions for defining external sources are realized as plugins to the reasoner 
which exploit a provided interface and can be written either in Python or C++. 
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Example 5 ( ctd .) 

The HEX program P SW im with external semantics as given in the previous example has a 
single answer set 

/ = {swim(outd), goto(altD), ngoto(gansD ), go, need(loc, yogamat)}. 

(Here, and in following examples, we omit from all interpretations and answer sets.) 

Under /, the external atom &rq[goto\{yogamat) is true and all others ( &rq[swim\{money ), 
&rq[goto\(money), &rq[swim\(yogamat), ...) are false. Intuitively, answer set I tells 
Alice to take her Yoga mat and go for a swim to Alte Donau. 


HEX programs I Eiter et al, 2005) > are a conservative extension of disjunctive (resp., normal) 
logic programs under the answer set semantics: answer sets of ordinary nondisjunctive 


HEX programs coincide with stable models of logic programs as proposed by Gelfond and 


[Lifschitz (1988) , and answer sets of ordinary HEX programs coincide with stable models of 
disjunctive logic programs ([Przyinusinski 1991 1 Gelfond and Lifschitz 19911. 


2.3 Using UEX-Programs for Knowledge Representation and Reasoning 


While ASP is well-suited for many problems in artificial intelligence and was successfully 
applied to a range of applications (cf. e.g. ( Brewka et al. 2011) ), modern trends computing, 
for instance in distributed systems and the World Wide Web, require accessing other sources 
of computation as well. HEX-programs cater for this need by its external atoms which 
provide a bidirectional interface between the logic program and other sources. 

One can roughly distinguish between two main usages of external sources, which we will 
call computation outsourcing , knowledge outsourcing, and combinations thereof. However, 
we emphasize that this distinction concerns the usage in an application but both are based 
on the same syntactic and semantic language constructs. For each of these groups we will 
describe some typical use cases which serve as usage patterns for external atoms when 
writing HEX-programs. 


2.3.1 Computation Outsourcing 

Computation outsourcing means to send the definition of a subproblem to an external 
source and retrieve its result. The input to the external source uses predicate extensions and 
constants to define the problem at hand and the output terms are used to retrieve the result, 
which can in simple cases also be a Boolean decision. 


On-demand Constraints A special case of the latter case are on-demand constraints of type 
•<— &forbidden[pi,... ,p„]() which eliminate certain extensions of predicates pi,... ,p n . 
Note that the external evaluation of such a constraint can also return reasons for conflicts 
to the reasoner in order to restrict the search space and avoid reconstruction of the same 
conflict I Eiter et al. 2012). This is similar to the CEGAR approach in model checking (Clarke 


et al. 2003) and can be helpful for reducing the size of the ground program: constraints do 


not need to be grounded but they are outsourced into an external atom of the above form, 
which then returns violated constraints as nogoods to the solver. This technique has been 
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used for efficient planning in robotics where external atoms verify the feasibility of a 3D 
motion ( |Schuller et al. 2013) . 


Computations which cannot (easily) be Expressed by Rules Outsourcing computations 
also allows for including algorithms which cannot easily or efficiently be expressed as a 
logic program, e.g., because they involve floating-point numbers. As a concrete example, 
an artificial intelligence agent for the skills and tactics game AngryBirds needs to perform 
physics simulations ( [Calimeri et al. 20T3 i. As this requires floating point computations 
which can practically not be done by rules as this would either come at the costs of very 
limited precision or a blow-up of the grounding, HEX-programs with access to an external 
source for physics simulations are used. 


Complexity Lifting External atoms can realize computations with a complexity higher than 
the complexity of ordinary ASP programs. The external atom serves than as an ‘oracle’ 
for deciding subprograms. While for the purpose of complexity analysis of the formalism 


it is often assumed that external atoms can be evaluated in polynomial time (Faber et al 
2004 as long as external sources are decidable there is no practical reason for limiting 


their complexity (but of course a computation with greater complexity than polynomial 
time lifts the complexity results of the overall formalism as well). In fact, external sources 
can be other ASP- or HEX-programs. This allows for encoding other formalisms of higher 
complexity in HEX-programs, e.g., abstract argumentation frameworks (Dung 1995). 


2.3.2 Knowledge Outsourcing 


In contrast, knowledge outsourcing refers to external sources which store information which 
needs to be imported, while reasoning itself is done in the logic program. 

A typical example can be found in Web resources which provide information for import, 


e.g., RDF triple stores (Lassila and Swick 1999[) or geographic data (Mosca and Bernini 


2008). More advanced use cases are multi-context systems, which are systems of knowledge¬ 
bases ( contexts ) that are abstracted to acceptable belief sets (roughly speaking, sets of atoms) 
and interlinked by bridge rules that range across knowledge bases ( Brewka and Eiter 2007) >; 
access to individual contexts has been provided through external atoms ( jBogl et al. 2010 ). 
Also sensor data, as often used when planning and executing actions in an environment, is a 


form of knowledge outsourcing (cf. Acthex (Basol et al. 20101). 


2.3.3 Combinations 


It is also possible to combine the outsourcing of computations and of knowledge. A typical 
example are logic programs with access to description logic knowledge bases (DL KBs), 
called DL-programs (Eiter et al. 2008). A DL KB does not only store information, but also 
provides a reasoning mechanism. This allows the logic program for formalizing queries 
which initiate external computations based on external knowledge and importing the results. 


1 Under this assumption, deciding the existence of an answer set of a propositional HEX-program is Y,f -complete. 
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3 Extensional Semantics and Atom Dependencies 

We now introduce additional important notions related to HEX-programs. Some of the 
following concepts are needed to make the formalism decidable, others prepare the basic 
evaluation techniques presented in later sections. 


3.1 Restriction to Extensional Semantics for HEX External Atoms 


To make HEX programs computable in practice, it is useful to restrict external atoms, such 


that their semantics depends only on extensions of predicates given in the input tuple (Eiter 
et al. 2006|. This restriction is relevant for all subsequent considerations. 


Syntax Each &g is associated with an input type signature t\,... ,t n such that every t, 
is the type of input Y t at position i in the input list of &g. A type is either const or a 
non-negative integer. 

Consider &g, its type signature ti ,... ,t n , and a ground external atom &g[yi, ..., 
y n \(x i, ... ,x m ). Then, in this setting, the signature of &g enforces certain constraints on 
f& g {I , yi,..., y n , Xi,..., x m ) such that its truth value depends only on 

(a) the constant value of yt whenever ti = const, and 

(b) the extension of predicate yt. of arity ti, in I whenever ti £ N. 

Note that parameters of type const are different from parameters of type 0. In the former 
case, a parameter is interpreted as a constant that is passed to the external source (essentially 
as string “p"), while a parameter p with a non-negative integer as type is interpreted as 
predicate whose extension is passed; in the special case of type 0, the extension reduces to 
the truth value of the propositional atom p. 

Example 6 ( ctd .) 

Continuing Example[l] for &reach[edge , a] (a;), we have t\ = 2 and t 2 = const. Therefore 
the truth value of &reach[edge , a](x) depends on the extension of binary predicate edge, 
on the constant a, and on x. 

Continuing Example |4j the external predicate &rq has t\ = 1, therefore the truth value of 
&rq\swim\(x) for various x wrt. an interpretation I depends on the extension of the unary 
predicate swim in the input list. 

Note that the truth value of an external atom with only constant input terms, i.e., f t = 
const, 1 < i < n, is independent of I. 

Semantic constraints enforced by signatures are formalized next. 


Semantics Let a be a type, I be an interpretation and p £ C. The projection function 
II a (I,p) is the binary function such that n cons t(/,p) = p for a = const, and Il a (I,p) = 
{{p,x i,..., x a ) | p (xi ,..., x a ) £ 1} for a £ N. Recall that atoms p{x i,..., x a ) are 
tuples (p,x i,..., x a ). The codomain D a of II a (/,p) is D a := C a+1 for a £ N, i.e., the 
a+l-fold cartesian product of C, which contains all syntactically possible atoms with a 
arguments; furthermore we let D const := C. 
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Definition 3 (extensional evaluation function) 

Let &g be an external predicate with oracle function f& g , in(&g ) = n, out(&g) = to, and 
type signature t\,..., t n . Then the extensional evaluation function F& g : D tl x • • • x D tri —t 
2 Cm of &g is defined such that for every a = (ai,..., a m ) 

^ £ F&g (n tl (I ,p\), . . . , H tn (I, p n )) iff f &g (d, Pi , • • • ; Pn , til ) * • • ? &m) — 1. 

Note that F& g makes the possibility of new constants in external atoms more explicit: tuples 
returned by F& g may contain constants that are not contained in P. Furthermore, F& g is 
well-defined only under the assertion at the beginning of this section. 

Example 7 ( ctd .) 

For I from Example[5] we have swim ) = {(swim, outd)} and 

ni(/, goto) = {(goto, altD)}. The extensional evaluation function of &rq is 

F& rq (U) = {(money) \ (X, ind) £ U or (X, gansD) £ U} U 

{( yogamat) \ (X, altD) £ U} U {(goggles) \ (X , amalB) £ U} 
Observe that none of the constants yogamat and goggles occurs in P (we have that 
const(P) = {swim, goto, ngoto, need, go, inoutd, loc, ind, outd, amalB, gansD, altD, 
margB, money, location}). These constants are introduced by the external atom semantics. 
Note that (money) is a unary tuple, as &rq has a unary output list. 


3.2 Atom Dependencies 


To account for dependencies between heads and bodies of rules is a common approach for 
realizing semantics of ordinary logic programs, as done, e.g., by means of the notions of 


stratification and its refinements like local stratification (Przymusinski 1988 1 or modular 
stratification (Ross 1994 1 , or by splitting sets ( Lifschitz and Turner 1994) . In HEX pro¬ 
grams, head-body dependencies are not the only possible source of predicate interaction. 


Therefore new types of (non-ground) dependencies were considered by Eiter et al. (2006) 
and Schindlauer (2006| l. In the following we recall these definitions but slightly reformulate 
and extend them, to prepare for the following sections where we lift atom dependencies to 
rule dependencies. 

In contrast to the traditional notion of dependency, which in essence hinges on proposi¬ 
tional programs, we must consider non-ground atoms; such atoms a and b clearly depend 
on each other if they unify, which we denote by a ~ b. 

For analyzing program properties it is relevant whether a dependency is positive or 
negative. Whether the value of an external atom a depends on the presence of an atom b in 
an interpretation I depends in turn on the oracle function f& g that is associated with the 
external predicate &g of a. Depending on other atoms in I, in some cases the presence of 
b might make a true, in some cases its absence. Therefore we will not speak of positive 


and negative dependencies, as by Eiter et al. (2011 1 , but more adequately of monotonic and 
nonmonotonic dependencies, respectively]^] 


4 Note that anti-monotonicity (i.e., a larger input of an external atom can only make the external atom false, but 
never true) could be a third useful distinction that was exploited in (Eiter et al. 201 2\ . We here only distinguish 
monotonic from nonmonotonic external atoms and classify antimonotonic external atoms as nonmonotonic. 
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Definition 4 

An external predicate &g is monotonic, if for all interpretations /, /' such that / Cl' 
and all tuples X of constants, f& g (I, X) = 1 implies X) = 1; otherwise &g is 

nonmonotonic. Furthermore, a ground external atom a is monotonic, if for all interpretations 
I , /' such that I C I' we have / |= a implies I' \= a; a non-ground external atom is 
monotonic, if each of its ground instances is monotonic. 

Clearly, each external atom that involves a monotonic external predicates is monotonic, 
but not vice versa; thus monotonicity of external atoms is more fine-grained. In the following 
formal definitions, for simplicity we only consider external predicate monotonicity and 
disregard external atom monotonicity. However the extension to arbitrary monotonic external 
atoms is straightforward. 

Example 8 (ctd.) 

Consider F& rq (U) in Example]?] adding tuples to U cannot remove tuples from F& rq (U), 
therefore &rq is a monotonic external predicate. 

Next we define relations for dependencies from external atoms to other atoms. 

Definition 5 (External Atom Dependencies) 

Let P be a HEX program, let a = &g[X ..., X/ :: ](Y ) in P be an external atom with the 
type signature t±,... ,tk and let b = p{ Z) be an atom in the head of a rule in P. Then a 
depends external monotonically (resp., nonmonotonically) on b, denoted a b (resp., 
a -+ e n b), if &g is monotonic (resp., nonmonotonic), and for some % £ {1 ..... /e} we have 
that Z has arity fj £ Nand Xi = p. We define that a — > e b if a —> e m b or a —> e n b. 


Example 9 (ctd.) 

In our example we have the three external dependencies &rq[swim\(C ) — swim(ind), 
&rq[swim\(C) —swim(outd), and &rq[goto\(C) goto(X). 

As in ordinary ASP, atoms in HEX programs may depend on each other because of rules 
in the program. 


Definition 6 

For a HEX-program P and atoms a, 3 occurring in P, we say that 


(a) a depends monotonically on j3 (a — 5), if one of the following holds: 

(i) some rule r £ P has a £ H(r) and (3 £ B + (r)\ 

(ii) there are rules ri, r 2 £ P such that a £ B(ri), (3 £ H(rf), and a ~ f3; or 

(iii) some rule r £ P has a £ H(r) and (3 £ Ft(r). 

(b) a depends nonmonotonically on 3 (a —> n 3), if there is some rule r £ P such that 

a £ H{r) and 3 € B~(r). 


Note that combinations of Definitions [5] and [ 6 ] were already introduced by Schindlauer 
(20061 and Eiter et al. (2009[ >; however these papers represent nonmonotonicity of external 


atoms within rule body dependencies and use a single ‘external dependency’ relation that 
does not contain information about monotonicity. In contrast, we represent nonmonotonicity 
of external atoms where it really happens, namely in dependencies from external atoms to 










12 


Eiter, Fink, Ianni, Krennwallner, Redl, and Schuller 


m 



swim(ind) swim(outd) 



&rq [swim] (C) swim(P) 



Fig. 2: Atom dependency graph of running example P SW i m - 

ordinary atoms. We therefore obtain a simpler dependency relation between rule bodies and 
heads. 

We say that atom a depends on atom /3, denoted a — > /3, if either a —/?, a —> n /3, or 
a —> e /?; that is, —»• is the union of the relations —» m , —> n , and — t e . 

We next define the atom dependency graph. 

Definition 7 

For a HEX-program P, the atom dependency graph ADG(P) = ( Va,Ea ) of P has as 
vertices Va the (possibly non-ground) atoms occurring in non-facts of P and as edges Ea 
the dependency relations —> m , ~>rn and —between them in P. 

Example 10 (ctd.) 

Figure[2]shows ADG(P SW i m ). Recall that cj is ‘ not go’. Note that the nonmonotonic 
body literal in C 7 does not show up as a nonmonotonic dependency, as C 7 has no head atoms. 
(The rule dependency graph in Section[4]will make this negation apparent.) 

Next we use the dependency notions to define safety conditions on HEX programs. 

3.3 Safety Restrictions 

To make reasoning tasks on HEX programs decidable (or more efficiently computable), the 
following potential restrictions were formulated. 

Rule safety This is a restriction well-known in logic programming, and it is required to 
ensure finite grounding of a non-ground program. A rule is safe, if all its variables are safe, 
and a variable is safe if it is contained in a positive body literal. Formally a rule r is safe iff 
variables in H(r) U B~(r) are a subset of variables in B + (r). 
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Domain-expansion safety In an ordinary logic program P, we usually assume that the set 
of constants C is implicitly given by P. In a HEX program, external atoms may invent new 
constant values in their output tuples. We therefore must relax this to ‘C is countable and 
partially given by P’, as shown by the following example. 


Example 11 

In the Swimming Example, grounding P SW im with const (P SW i m ) is not sufficient. Further 
constants ‘generated’ by external atoms must be considered. For example yogamat <f 
const(P SW im) and I \= &rq[goto](yogamat), hence we must ground 

need(loc,C) <— &rq[goto\(C) 
with C = yogamat to obtain the correct answer set. 


Therefore grounding P with const(P) can lead to incorrect results. Hence we want to 
obtain new constants during evaluation of external atoms, and we must use these constants 
to evaluate the remainder of a given HEX program. However, to ensure decidability, this 
process of obtaining new constants must always terminate. 

Hence, we require programs to be domain-expansion safe ( Eiter et al, 20Q6) >: there must 
not be a cyclic dependency between rules and external atoms such that an input predicate of 
an external atom depends on a variable output of that same external atom, if the variable is 
not guarded by a domain predicate. 

With HEX we need the usual notion of rule safety, i.e., a syntactic restriction which 
ensures that each variable in a rule only has a finite set of relevant constants for grounding. 

We first recall the definition of safe variables and safe rules for HEX. 


Definition 8 (Def. 5 by Eiter et al. (2006)) 

The safe variables of a rule r is the smallest set of variables X that occur either (i) in some 
ordinary atom (3 £ B + (r), or (ii) in the output list X of an external atom &g [Yi,..., Y n ] (X) 
in B + (r) where all Y-\,, Y n are safe. A rule r is safe , if each variable in r is safej^j 


However, safety alone does not guarantee finite grounding of HEX programs, because an 
external atom might create new constants, i.e., constants not part of the program itself, in its 
output list (see Example|7}. These constants can become part of the extension of an atom in 
the rule head, and by grounding and evaluation of other rules become part of the extension 
of a predicate which is an input to the very same external atom. 


Example 12 (adapted from Schindlauer (2006^ ) 

The following HEX program is safe according to Definition [8] and nevertheless cannot be 
finitely grounded: 


source (“http : //some_url’’) £- . 

url(X) <r- &rdf[source](X , “ rdf: sub Class Of", C ). 
source(X) ■£- url(X). 

Suppose the &rdf[source\(S , P, O) atom retrieves all triples (S, P, O) from all RDF triple¬ 
stores specified in the extension of source, and suppose that each triplestore contains a 


5 This is stated by Eiter et al. (20061 as ‘if each variable appearing in a negated atom and in any input list is safe, 
and variables appearing in H (r) are safe’, which is equivalent. 
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triple with a URL S that does not show up in another triplestore. As a result, all these URLs 
are collected in the extension of source which leads to even more URLs being retrieved and 
a potentially infinite grounding. 

However, we could change the rule with the external atom to 

url(X) £- &rdf[source](X, “rdf:subClassOf”,C), limit(X) (3) 

and add an appropriate set of limit facts. This addition of a range predicate limit(X ) which 
does not depend on the external atom output ensures a finite grounding. 


To obtain a syntactic restriction that ensures finite grounding for HEX, so called strong 
safety has been introduced for the HEX programs ( Eiter et al, 2006 1. Intuitively, this concept 
requires all output variables of cyclic external atoms (using the dependency notion from 
Definition[7]i to be bounded by ordinary body atoms of the same rule which are not part of 
the cycle. However, this condition is unnecessarily restrictive, and therefore, the extensible 
notion of liberal domain-expansion safety (Ide-safety) was introduced by |Eiter et al. (2014a| , 
which we will use in the following. For the purpose of this article, we may omit the formal 
details of lde-safety (see (Eiter et al. (2014a| and |Appendix D| for an outline); it is sufficient 
to know that every lde-safe program has a finite grounding that has the same answer sets as 
the original program. 


4 Rule Dependencies and Generalized Rule Splitting Theorem 

In this section, we first introduce a new notion of dependencies in HEX-programs, namely 
between non-ground rules in a program (Section pTT) . Based on this notion, we then present a 
modularity property of HEX-programs that allows us to obtain answer sets of a program from 
the answer sets of its components (Section [4.2[ ). The property is formulated as a splitting 
theorem based on dependencies among rules and lifts a similar result for dependencies 
among atoms, viz. the Global Splitting Theorem ( [Eiter et al. 2006| , to this setting, and 
it generalizes and improves it. This result is exploited in a more efficient HEX-program 
evaluation algorithm, which we show in Section [5] 


4.1 Rule Dependencies 
We define rule dependencies as follows. 

Definition 9 (Rule dependencies) 

Let P be a program and a, b atoms occurring in distinct rules r,s £ P. Then r depends on 
s according to the following cases: 

(i) if a ~ b, a £ B + (r), and b £ H(s), then r —y m s; 

(ii) if a ~ b, a £ B~ (r), and b £ H(s), then r —s; 

(iii) if a ~ b, a £ H{r), and b £ H(s), then both r —s and s -* m r; 

(iv) if a — b, a £ B(r) is an external atom, and b £ H(s), then 

• r -£ m s if a £ B + (r) and a —b , and 

• r — s otherwise. 

Intuitively, conditions ([I} and ([n]) reflect the fact that the applicability of a rule r depends 
on the applicability of a rule s with a head that unifies with a literal in the body of rule 
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r'i: swim(ind) V swim(outd) t— 

T 2 ■ need(inoutd, C) <— r 3 : goto(X) V ngoto(X) ■<— 

&rg[suwn](C') swim(P), location(P, X) 


r 4 : go <r- goto(X) 

n 

os■ ^ need(X, money) c§\ «— goto(X), goto(Y), X oj\ <— not go 

Fig. 3: Rule dependency graph of running example P SW im- 






r; condition ( pH] ) exists because r and s cannot be evaluated independently if they share 
a common head atom (e.g., u V v <— cannot be evaluated independently from v V w <—); 
and @ defines dependencies due to predicate inputs of external atoms. 

In the sequel, we let -^ m ,n = —> m U —> n be the union of monotonic and nonmonotonic 
rule dependencies. We next define graphs of rule dependencies. 

Definition 10 

Given a HEX-program P, the rule dependency graph DG(P) = (Vd, Ed) of P is the 
labeled graph with vertex set Vd = P and edge set Ed = —> m>n . 


Example 13 ( ctd .) 

Figure [3]depicts the rule dependency graph of our running example. According to Defini¬ 
tion [ 5 ] we have the following rule dependencies in P™-^: 

• due to (i) we have r 3 -A m n, r 4 -A m r 3 , c 6 -5> m r 3 , c 8 -S> m r 2 , and c 8 -5> m r 5 ; 

• due to (ii) we have C7 —r 4 ; 

• due to (iii) we have no dependencies; and 

• due to (iv) we have r 2 —> m n and r 5 — r 3 . 

Note that if we would omit the first argument of predicate need, we would have in addition 
r 2 —> m r,5 and r 2 due to (iii). Also note that &rq is monotonic (see Example [8). 


4.2 Splitting Sets and Theorems 


Splitting sets are a notion that allows for describing how a program can be decomposed into 
parts and how semantics of the overall program can be obtained from semantics of these 
parts in a divide-and-conquer manner. 

We lift the original HEX splitting theorem (|Eiter et al. 2006 Theorem 2) and the according 


definitions of global splitting set, global bottom, and global residual (Eiter et al. 2006 
Definitions 8 and 9) to our new definition of dependencies among rules. 

A rule splitting set is a part of a (non-ground) program that does not depend on the rest 
of the program. This corresponds in a sense with global splitting sets bylEiter et al. (2006 1 . 
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Definition 11 (Rule Splitting Set) 

A rule splitting set It for a HEX-program P is a set R C P of rules such that whenever 
r £ R, s £ P, and r — >m,n s, then s £ R holds. 

Example 14 ( ctd .) 

The following are some rule splitting sets of P SW i m ■ {tt}, {ri, r 2 }, {?T, r 3 }, {ri, r 2 , r 3 }, 
{ri, r 2 , r 3 , r§, eg}. The set R = {n, r 2 , eg} is not a rule splitting set, because eg r 5 

but r$ ^ R. 

Because of possible constraint duplication, we no longer partition the input program, and 
the customary notion of splitting set, bottom, and residual, is not appropriate for sharing 
constraints between bottom and residual. Instead, we next define a generalized bottom of 
a program, which splits a non-ground program into two parts which may share certain 
constraints. 

Definition 12 (Generalized Bottom) 

Given a rule splitting set R of a HEX-program P, a generalized bottom B of P wrt. R is 
a set B with R C B C P such that all rules in B \ R are constraints that do not depend 
nonmonotonically on any rule in P\ B. 

Example 15 (ctd.) 

A rule splitting set R of P SW im (e.g., those given in Example [T4 ] i is also a generalized 
bottom of P SW i m wrt. R. The set {r l5 r-i, eg} is not a rule splitting set, but it is a generalized 
bottom of P SW im wrt. the rule splitting set {ri, r 2 }, as eg is a constraint that depends only 
monotonically on rules in P sluim \ {ri, r 2 , eg}. 

Next, we describe how interpretations of a generalized bottom B of a program P lead to 
interpretations of P without re-evaluating rules in B. Intuitively, this is a relaxation of the 
previous non-ground HEX splitting theorem: a constraint may be put both in the bottom and 
in the residual if it has no nonmonotonic dependencies to the residual. The benefit of such 
constraint sharing is a smaller number of answer sets of the bottom, and hence of fewer 
evaluations of the residual program. 

Notation. For any set I of ground ordinary atoms, we denote by facts(I) the corresponding 
set of ground facts; furthermore, for any set P of rules, we denote by gh(P) the set of 
ground head atoms occurring in grnd(P). 

Theorem 1 (Splitting Theorem) 

Given a HEX-program P and a mle splitting set R of P, M £ AS(P) iff M £ AS(P \ R U 
facts(X)) with A' £ JS(R). 

Using the definition of generalized bottom, we generalize the above theorem. 

Theorem 2 (Generalized Splitting Theorem) 

Let P be a HEX-program, let R be a rule splitting set of P, and let B be a generalized 
bottom of P wrt. R. Then 

M £ AS(P) iff M £ AS(P \ R Li facts(X)) where A e JS(B). 
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Note that B \ R contains shareable constraints that are used twice in the Generalized 
Splitting Theorem, viz. in computing X and in computing M. 

The Generalized Splitting Theorem is useful for early elimination of answer sets of the 
bottom thanks to constraints which depend on it but also on rule heads outside the bottom. 
Such constraints can be shared between the bottom and the remaining program. 

Example 16 ( ctd .) 

We apply Theorems [T] and [2] to P SW im and compare them. Using the rule splitting set 
{ri,r 2 }, we can obtain AS(P sllnm ) by first computing AS({ri,r 2 }) = {h, h} where Ii = 
{ suuim(ind ), need(inoutd, money)}, I 2 = {swim(outd)}, and by then using Theorem^ 
X G AS(P swirn ) iff it holds that X G AS({r 3 , r 4 , r 5 , c 6 , C 7 , c 8 } U facts (J 1 )) or X G 
AS({r 3 , r 4 , r§, Cq, C 7 , c§} U /acts(/ 2 )). Note that the computation with I\ yields no answer 
set, as need(inoutd, money) G /1 satisfies the body of c 8 and ‘kills’ any model candidate. 
In contrast, if we use the generalized bottom {r j, r 2 , c 8 }, we have AS({n, r 2 , c 8 }) = 
{{ swim(outd )}} and can use Theorem[2]to obtain AS (P S wim) with only one further answer 
set computation: A G AS(P swim ) iff X G AS({r 3 ,r4,r 5 ,c 6 , C7,c s }U{swim(outd) «— }). 
Note that we use Cg in both computations, i.e., eg is shared between the generalized bottom 
and the remaining computation. 

Armed with the results of this section, we proceed to program evaluation in the next 
section. A discussion of the new splitting theorems that compares them to previous related 
theorems and argues for their advantage is given in Section [TT] 


5 Decomposition and Evaluation Techniques 


We now introduce our new HEX evaluation framework, which is based on selections of sets 
of rules of a program that we call evaluation units (or briefly units). 

The traditional HEX evaluation algorithm fEiter et al. 2006 1 uses a dependency graph 
over (non-ground) atoms, and gradually evaluates sets of rules (the ‘bottoms’ of a program) 
that are chosen based on this graph. In contrast our new evaluation algorithm exploits the 
rule-based modularity results for HEX-programs in Section [4] 

While previously a constraint can only kill models once all its dependencies on rules 
are fulfilled, the new algorithm increases evaluation efficiency by sharing non-ground 
constraints, such that they may kill models earlier; this is safe if all their nonmonotonic 
dependencies are fulfilled. Moreover, units no longer must be maximal. Instead, we require 
that partial models of units, i.e., atoms in heads of their rules, do not interfere with those of 
other units. This allows for independence, efficient storage, and easy composition of partial 
models of distinct units. 

In the following, we first define a decomposition of a HEX-program into evaluation units 
that are organized in an evaluation graph (Section |5. 1 1 >. Then we define an interpretation 
graph which contains input and output interpretations of each evaluation unit (Section [5~2} . 
We next extend this definition to answer set graphs which are related with answer sets of 
the program (Section |5.3| l. Finally Section 5.4 uses these definitions in an algorithm for 
enumerating answer sets of the HEX-program. 
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5.1 Evaluation Graph 


Using rule dependencies, we next define the notion of evaluation graph on evaluation units. 
We then relate evaluation graphs to splitting sets ( Lifschitz and Turner 1994) and show how 
to use them to evaluate HEX-programs by evaluating units and combining the results. 

We define evaluation units as follows. 


Definition 13 

An evaluation unit (in short ‘unit’) is any lde-safe HEX-program. 

The formal definition of lde-safety (see |Appendix D| and |Eiter et al, (2014a) l) is not crucial 
here, merely the property that a unit has a finite grounding with the same answer sets as 
the original unit which can be effectively computed; lde-safe HEX-programs are the most 
general class of HEX-programs with this property and computational support. 

An important point of the notion of evaluation graph is that rule dependencies r —> x s lead 
to different edges, i.e., unit dependencies, depending on the dependency type x £ { n, m} 
and whether r resp. s is a constraint; constraints cannot (directly) make atoms true, hence 
they can be shared between units in certain cases, while sharing non-constraints could 
violate modularity. 

Given a rule r £ P and a set U of units, we denote by U\ r = {u £ U \ r £ u} the set of 
units that contain rule r. 

Definition 14 (Evaluation graph ) 

An evaluation graph 8 = ( U , E) of a program P is a directed acyclic graph whose vertices 
U are evaluation units and which fulfills the following properties: 

(a) P = |J ue v u, i.e., every rule r £ P is contained in at least one unit; 

(b) every non-constraint r £ P is contained in exactly one unit, i.e., |(7| r | = 1; 

(c) for each nonmonotonic dependency r —> n s between rules r, s £ P and for all 
u £ U\ r , v £ U\ s , u v, there exists an edge ( u , v) £ E (intuitively, nonmonotonic 
dependencies between rules have corresponding edges everywhere in 8); and 

(d) for each monotonic dependency r —> m s between rules r, s £ P, there exists some 
u £ U\ r such that E contains all edges (u, v ) with v £ U\ s for v j- u (intuitively, for 
each rule r there is (at least) one unit in 8 where all monotonic dependencies from r to 
other rules have corresponding outgoing edges in 8). 

We remark that [Eiter et al. (201 1| > and [Schuller (2012) defined evaluation units as 
extended pre-groundable HEX-programs; later, |Redl (2014| and [Eiter et al. (2014a) ) defined 
generalized evaluation units as lde-safe HEX-programs, which subsume extended pre- 
groundable HEX-programs, and generalized evaluation graphs on top as in Definition [l4| 
As more the grounding properties of units matter than the precise fragment, we dropped 
here ‘generalized’ to avoid complex terminology. 

As a non-constraint can occur only in a single unit, the above definition implies that all 
dependencies of non-constraints have corresponding edges in 8, which is formally expressed 
in the following proposition. 
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Fig. 4: Evaluation graph 8\ for running example HEX program P SW i m . 



Fig. 5: Evaluation graph 8 2 for running example HEX program P SW im- 


Proposition 1 

Let 8 = (U, E ) be an evaluation graph of a program P, and assume r -> mj „ s is a 
dependency between a non-constraint r £ P and a rule s £ P. Then {(u,v) \ u £ U\ r ,v £ 
U\ a } C E holds. 

Example 17 ( ctd '.) 

Figures|4]and[5]show two possible evaluation graphs for our running example. The evaluation 
graph 8 \ contains every rule of P SW i m in exactly one unit. In contrast, 82 contains eg both 
in U2 and in 1x4. Condition (JdJ of Definition [14] is particularly interesting for these two 
graphs; it is fulfilled as follows. Graph 8\ can be obtained by contracting rules in the rule 
dependency graph DG(P SW i m ) into units, i.e., 8\ is a (graph) minor of DG(P SW im) and 
therefore all rule dependencies are realized as unit dependencies and Conditions 0 and 0 
are satisfied. In contrast, S 2 is not a minor of D6'(P s , Ijrm ) because dependency c§ —> m r 5 
is not realized as a dependency from u 2 to ii 4 . Nonetheless, all dependencies from c§ are 
realized at u.\ and thus £ 2 conforms with condition <[d]>, which merely requires that rule 
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dependencies have edges corresponding to all monotonic rule dependencies at some unit of 
the evaluation graph. 

Evaluation graphs have the important property that partial models of evaluation units do 
not intersect, i.e., evaluation units do not mutually depend on each other. This is achieved 
by acyclicity and because rule dependencies are covered in the graph. 

In fact, due to acyclicity, mutually dependent rules of a program are contained in the 
same unit; thus each strongly connected component of the program’s dependency graph is 
fully contained in a single unit. Furthermore, a unit can have in its rule heads only atoms 
that do not unify with atoms in the rule heads of other units, as rules which have unifiable 
heads mutually depend on one another. This ensures that under any grounding, the following 
property holds. 

Proposition 2 (Disjoint unit outputs ) 

Let £ = (U, E) be an evaluation graph of a program P. Then for each distinct units 

Mi, m 2 £ U, it holds that gh{u\) IT gh(u 2 ) = 00 


Example 18 (ctd.) 

Figures[4]and[5]show for each unit which atoms can become true due to rule heads in them, 
denoted as ‘derived’ atoms. Observe that both graphs have strictly non-intersecting atoms 
in rule heads of distinct units. 

As units of evaluation graphs can be arbitrary lde-safe programs, we clearly have the 
following property. 

Proposition 3 

For every lde-safe HEX program P, some evaluation graph £ exists. 

Indeed, we can simply put P into a single unit to obtain a valid evaluation graph. Thus the 
HEX evaluation approach based on evaluation graphs is applicable to all domain-expansion 
safe HEX programs. 


5.1.1 Evaluation Graph Splitting 

We next show that units and their predecessors in an evaluation graph correspond to 
generalized bottoms. We then use this property to formulate an algorithm for unit-based, 
efficient evaluation of HEX-programs. 

Given an evaluation graph £ = ( U , E), we write u < w, if a path from uto w exists in £, 
and u < w if either u < w or u = w. 

For a unit u £ U, we denote by preds £ (u) = {v £ U | ( u , v) £ E} the set of units on 
which u (directly) depends and by u K = LUet/ u<w w '^ le set °f ru l es i n all units on which 
u transitively depends; furthermore, we let u- = u < U u. Note that for a leaf unit u (i.e., u 
has no predecessors) we have preds £ (u) = m < = 0 and u- = u. 


6 See page|l6|for the definition of notation gh{P). 
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fai(b) = {e} 


fai(a) = {d, e} 


Fig. 6 : First Ancestor Intersection units (FAIs) in an evaluation graph. 


Theorem 3 

For every evaluation graph £ = (U,E) of a HEX-program Q and unit u £ U, it holds that 
m < is a generalized bottom of u- wrt. R= {r £ u K \ H(r) ^ 0}. 

Example 19 ( ctd '.) 

In £\, uf = u\ and uf = u.-| Uz / 2 and uf is a generalized bottom of uf wrt. R = 
{r 1; r 3 , r 4 }. In £ 2 , we have uf = u 3 U zz 2 U u 3 and uf = P SW i m and uf is a generalized 
bottom of P SW i m wrt. R = {ri, r 2 , r 3 , r 4 }. We can verify this on Definition [l2j we have 
P = Pswirm B = uf = {r*i, r 2 , r 3 , 7 " 4 , c 6 , C 7 , c 8 }, and R as above. Then RCBCP , 
and furthermore B \ R = { c ( ;, C 7 , c s } consists of constraints none of which depends 
nonmonotonically on a rule in P \ B = {r 5 }. 

Theorem 4 

Let £ = ( U , E) be an evaluation graph of a HEX-program Q and u £ U. Then for every 
unit v! £ preds £ (u), it holds that u'~ is a generalized bottom of the subprogram iP wrt. 
the rule splitting set R = {r £ u'~ \ H(r) 0}. 

Example 20 (ctd.) 

In £\, we have u\ £ preds £l (t/ 2 ); hence zip = iti is by Theorem[4]a generalized bottom of 
ulf = u\ wrt. R = {n, r 3 , r 4 }. Furthermore, 112 £ preds £l (u 3 ) and hence uf = u 3 U zx 2 
is a generalized bottom of uf = u± UU 2 wrt. R = {r 3 , r 2 , rg, r±, r 3 }. Thecaseof £ 2 andzt 4 
is less clear. We have u 2 £ preds £2 (ZZ 4 ), thus by Theorem kmf = zti U zz 2 = {ri, r 2 , Cs} 
is a generalized bottom of uf = ziiUzi 2 Uzi 3 wrt. R = {ri,r 2 }. Comparing against 
Definition [|2j we have P = u\Uu 2 Uu 3 and B = ziiUzi 2 ; thus indeed RCBCP 
and no constraint in B \ R = {cs} depends nonmonotonically on any rule in P\B = 

{f3,f4,C 6 ,C 7 }. 


5.1.2 First Ancestor Intersection Units 

We will use the evaluation graph for model building; as syntactic dependencies reflect 
semantic dependencies between units, multiple paths between units require attention. Of 
particular importance are first ancestor intersection units, which are units where distinct 
paths starting at some unit meet first. More formally. 
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Definition 15 

Given an evaluation graph £ = ( U, E ) and units v f w G U, we say that unit w is a first 
ancestor intersection unit (FAI) of v, if paths ji \ / p 2 from v to w exist in E that overlap 
only in v and w. By fai(v) we denote the set of all FAIs of v. 


Example 21 

Figure [6] sketches an evaluation graph with dependencies a —> b —> c—> e —> f, a —> d —> 
e —> g, and b — > d. We have that fai(a ) = {d, e}, fai(b) = {e}, and fai(u ) = 0 for each 
u £ U \ {a, 6 }. In particular, / and g are not FAIs of b, because all pairs of distinct paths 
from b to / or g overlap in more than two units. 

Note that for tree-shaped evaluation graphs, fai{v) = 0 for each unit v as paths between 
nodes in a tree are unique. 


Example 22 ( ctd .) 

The evaluation graph £\ of P SW im is a tree (see Fig.[4]>, thus/ai(w) = 0 for u £ {iti, u 2 , W 3 }. 
In contrast, the evaluation graph £ 2 of P S wim (see Fig. [5} is not a tree; we have that 
fai{uf) = {rti} and no other unit in £ 2 has FAIs. 


We can build an evaluation graph £ for a program P based on the dependency graph 
DG(P). Initially, the units are set to the maximal strongly connected components of 
DG(P), and then units are iteratively merged while preserving acyclicity and the conditions 
(a)-(d) of an evaluation graph; we will discuss some existing heuristics in Section 6.2 while 
for details we refer to jRedl (2014j ). 


5.2 Interpretation Graph 

We now define the Interpretation Graph (short i-graph), which is the foundation of our 
model building algorithm. An i-graph is a labeled directed graph defined wrt. an evaluation 
graph, where each vertex is associated with a specific evaluation unit, a type (input resp. 
output interpretation) and a set of ground atoms. 

We do not use interpretations themselves as vertices, as distinct vertices may be associated 
with the same interpretation; still we call vertices of the i-graph interpretations. 

Towards defining i-graphs we first define an auxiliary concept called interpretation 
structure. We then define i-graphs as the subset of interpretation structures that obey certain 
topological and uniqueness conditions. Finally we present an example (Example |23| and 
Figure [ 8 }. 

Definition 16 (Interpretation Structure ) 

Let £ = ( U, E) be an evaluation graph for a program P. An interpretation structure X 
for £ is a directed acyclic graph X = (M, F, unit, type, int) with nodes M C I id from 
a countable set X u j of identifiers, edges F C M x M, and total node labeling functions 

unit: M —>U, type: M —t { 1 , o}, and int: M — > 2 HBp . 

The following notation will be useful. Given unit u £ U in the evaluation graph associated 
with an i-graphZ, we denote by i-intsx{u ) = {m £ M \ unit(m) = u and type(m) = 1 } the 
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input (i-interpretations, and by o-intsx(u) = {to 6 M \ unit(m) = u and type{m ) = 0 } 
the output (o-)interpretations of I at unit u. For every vertex m £ M, we denote by 

int + (m ) = int(m ) U | m £ M and m! is reachable from m in 1} 

the expanded interpretation of to. 

Given an interpretation structure X = (M, F, unit , type , inf) for £ = (U, E ) and a unit 
u G U, we define the following properties: 

(IG-I) I-connectedness: for every to G o-intsx(u), it holds that |{ m' | (to, to') G F}\ = 

|{to' G i-intsz(u ) | (to, to') G F}\ = 1; 

(IG-O) O-connectedness: for every to G i-intsi(u), |{m, | (to, to*) G F}| = |preds f (w)| 
and for every m G preds £ (u) we have ({to* G o-intsx(ui) | (to, to*) G F}| = 1; 

(IG-F) M/ intersection: let £' be the subgraph of £ on the units reachable from iQand for 
every to G i-intsx(u), let I' be the subgraph of X reachable from to. Then X' contains 
exactly one o-interpretation at each unit of £'. (Note that both X and £ are acyclic, 
hence X' does not include m and £' does not include u.) 

(IG-U) Uniqueness: for every toi m 2 G M such that unit{m\) = unitiinx) = it, we 
have int + {m\) 7 ^ int + (m 2 ) (the expanded interpretations differ). 


Definition 17 (Interpretation Graph) 

Let £ = (U, E) be an evaluation graph for a program P. then an interpretation graph 
(i-graph) for £ is an interpretation structure X = (M, F, unit , type, int) that fulfills for 
every unit u G U the conditions (IG-I), (IG-O), (IG-F), and (IG-U). 

Intuitively, the conditions make every i-graph ‘live’ on its associated evaluation graph: an 
i-interpretation must conform to all dependencies of the unit it belongs to, by depending 
on exactly one o-interpretation at that unit’s predecessor units (IG-O); moreover an o-inter- 
pretation must depend on exactly one i-interpretation at the same unit (IG-I). Furthermore, 
every i-interpretation depends directly or indirectly on exactly one o-interpretation at each 
unit it can reach in the i-graph (IG-F); this ensures that no expanded interpretation int + (m) 
‘mixes’ two or more i-interpretations resp. o-interpretations from the same unit. (The effect 
of condition (IG-F) is visualized in Figure [7]) Finally, redundancies in an i-graph are ruled 
out by the uniqueness condition (IG-U). 


Example 23 ( ctd.) 

Figure [ 8 ] shows an interpretation graph X 2 for £ 2 . The unit label is depicted as dashed 
rectangle labeled with the respective unit. The type label is indicated after interpretation 
names, i.e., toi/i denotes that interpretation toi is an input interpretation. For X 2 the set 
lid of identifiers is {mi,..., m.15}. The symbol / in a unit u pointing to an i-interpretation 
to indicates that there is no o-interpretation wrt. input to of unit u. Section 5.4 describes an 
algorithm for building an i-graph given an evaluation graph. 

Dependencies are shown as arrows between interpretations. Observe that I-connectedness 
(IG-I) is fulfilled, as every o-interpretation depends on exactly one i-interpretation at the 
same unit. For example mg and inm depend on m. 7 . O-connectedness (IG-O) is similarly 


7 I.e., £' is the subgraph of £ induced by the set of units reachable from u, including v/; in abuse of terminology, 
we briefly say 'the subgraph (of £) reachable from’ 
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unit 

—1> unit dependency 
O i-interpretation 
0 o-interpretation 
—► dependency 




Fig. 7: Interpretation Graphs: violation of the FAI condition on the left, correct situation on 
the right. 



Fig. 8 : Interpretation graph 1-2 for Ey dashed areas group interpretations according to their 
unit(-) value. 


fulfilled, in particular consider i-interpretations of u 4 in 'Ey v,\ has two predecessor units 
( U 2 and U3) and every i-interpretation at u 4 depends on exactly one o-interpretation at n-> 
and exactly one o-interpretation at 113. The condition on FAI intersection (IG-F) could 
only be violated by i-interpretations at U4, concretely it would be violated if two different 
o-interpretations are reachable at U\ from one i-interpretation at 114. We can verify that 
from both TO 13 and 17144 we can reach exactly one o-interpretation at each unit; hence the 
condition is fulfilled. An example for a violation would be an i-interpretation at 114 that 
depends on itiq and mg: in this case we could reach two distinct o-interpretations m 2 and 
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m 3 at «i, thereby violating (1G-F). Uniqueness (IG-U) is satisfied, as in both graphs no unit 
has two output models with the same content. 

Note that the empty graph is an i-graph. This is by intent, as our model building algorithm 
will progress from an empty i-graph to one with interpretations at every unit, precisely if 
the program has an answer set. 


5.2.1 Join 

We will build i-graphs by adding one vertex at a time, always preserving the i-graph 
conditions. Adding an o-interpretation requires to add a dependency to one i-interpretation 
at the same unit. Adding an i-interpretation similarly requires addition of dependencies. 
However this is more involved because condition (IG-F) could be violated. Therefore, we 
next define an operation that captures all necessary conditions. 

We call the combination of o-interpretations which yields an i-interpretation a ‘join’. 
Formally, the join operation ‘ tx ’ is defined as follows. 

Definition 18 

Let I = (M , F, unit, type, int ) be an i-graph for an evaluation graph 8 = (V, E) of a 
program P. Let u £ V be a unit, let preds £ (u ) = {mi, ..., m^} be the predecessor units 
of m, and let to, £ o-intsz(u.i), 1 < i < k, be an o-interpretation at Mj. Then the join 
mi tx] • • • Ixi rn.fc = Ui<i<fc int(mi) at u is defined iff for each u' £ fai(u) the set of 
o-interpretations at u' that are reachable (in F) from some o-interpretation to,, 1 < i < 
contains exactly one o-interpretation m' £ o-intsi(u'). 

Intuitively, a set of interpretations can only be joined if all interpretations depend on the 
same (and on a single) interpretation at every unit. 

Example 24 ( ctd '.) 

In I 2 , i-interpretations m\, TO 4 , TO5, m.7, and to§ are created by trivial join operations with 
none or one predecessor unit. For 77113 and TO14, we have a nontrivial join: ot((to 13 ) = 
int(mf) U int (mu) and the join is defined because fai(uf) = {tti}, and from and 
TOn we can reach in I 2 exactly one o-interpretation at ui. Observe that the join m 6 tx] 7779 
is not defined, as we can reach in I 2 from {to 6 , m. 9 } the o-interpretations m 2 and to 3 at 
Mi, and thus more than exactly one o-interpretation at some FAI of M4. Similarly, the join 
me tx] ?7iio is undefined, as we can reach m 2 and m 3 at ui. 

The result of a join is the union of predecessor interpretations; this is important for answer 
set graphs and join operations on them, which comes next. Note that each leaf unit (i.e., 
without predecessors) has exactly one well-defined join result, viz. 0 . 

If we add a new i-interpretation from the result of a join operation to an i-graph and 
dependencies to all participating o-interpretations, the resulting graph is again an i-graph; 
thus the join is sound wrt. to the i-graph properties. Moreover, each i-interpretation that can 
be added to an i-graph while preserving the i-graph conditions can be synthesized by a join; 
that is, the join is complete for such additions. This is a consequence of the following result. 
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Proposition 4 

Let I = ( M, F, unit, type, int) be an i-graph for an evaluation graph £ = (V,E) and u £ V 
with preds £ (u ) = { u ±,..., u *,}. Furthermore, let m^ £ o-intsx{ui), 1 < i < k, such that 
no vertex m £ i-intsx{u) exists such that {(to, mi),..., (m, to*,)} C F. Then the 
join J = mi ixi • • • x TO* is defined at u iff X' = ( M',F', unit', type', int') is an i- 
graph for £ where (a) M' = M U {to 7 } for some new vertex to 7 elid \ M, (b) F 7 = 
F U {(to 7 , mt) | 1 < i < k}, (c) unit' = unit U {(to 7 , it)}, (d) type' = type U {(to 7 , i)}, 
and (e) int' = int U {(to 7 , J)}. 

Note that the i-graph definition specifies topological properties of an i-graph wrt. an eval¬ 
uation graph. In the following we extend this specification to the contents of interpretations. 

5.3 Answer Set Graph 

We next restrict i-graphs to answer set graphs such that interpretations correspond with 
answer sets of certain HEX programs that are induced by the evaluation graph. 

Definition 19 (Answer Set Graph) 

An answer set graph A = ( M , F, unit, type, int) for an evaluation graph 8 = ( U, E) is an 
i-graph for 8 such that for each unit u £ U, it holds that 

(a) {int + {m) \ m £ i-intsx(u)} C AS(u < ), i.e., every expanded i-interpretation at u is 
an answer set of u < \ 

(b) {int + {m) \ m £ o-intsx{u)} C AS(u-), i.e., every expanded o-interpretation at u is 
an answer set of u -; and 

(c) for each to £ i-intsx(u), it holds that int(m) = (J^ m m .\ eF int(mi). 

Note that each leaf unit u, has ii < = 0, and thus 0 is the only i-interpretation possible. 
Moreover, condition 0 is necessary to ensure that an i-interpretation at unit u contains all 
atoms of answer sets of predecessor units that are relevant for evaluating u. Furthermore, 
note that the empty graph is an answer set graph. 

Example 25 ( ctd .) 

The example i-graph I 2 is in fact an answer set graph. First, int + {m\) = 0 and ufi 0 
and indeed 0 £ _4S(0) which satisfies condition 0- Less obvious is the case of o-interpre- 
tation me in I 2 ' int + {ine) = {swim(outd)} and itf = {ri, 7 - 2 , cs}; as c$ kills all answer 
sets where money is required, -4S({ri, 7 - 2 , cs}) = {{swim(outd)}}', hence int + (me) is 
the only expanded interpretation of an o-interpretation possible at 112 . Furthermore, the 
condition (IG-U) on i-graphs implies that me is the only possible o-interpretation at U 2 - 
Consider next TO 13 : 

uf = {r 1 ,r 2 ,r 3 ,r 4 ,c 6 ,c 7 ,c 8 } and 
int + (mi 3 ) = {go, goto(altD), ngoto{gansD), swim{outd){. 

The two answer sets of U 4 are {go, goto(altD), ngoto(gansD), swim(outd)}, and {go, 
goto{gansD), ngoto{altD), swim{outd)}, and int + [rriifi) is one of them; the other one is 
int + {mifi). Finally 

int + (mie) = {swim(outd), goto(altD), go, ngoto{gansD), need(loc, yogamat)}, 

which is the single answer set of uf = P SW im- 
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Similarly as for i-graphs, the join is a sound and complete operation to add i-interpreta- 
tions to an answer set graph. 

Proposition 5 

Let A = (M , F, unit, type, int) be an answer set graph for an evaluation graph £ = (V, E) 
and let u £ V with preds £ (u) = {ui,... ,Uk}- Furthermore, let mi £ o-intsj.(ui), 
1 < i < k, such that no m £ i-ints^(u) with {(to, toi), ..., (to, ink)} C F exists. Then 
the join J = Toi ix • • • cxi ink is defined at u iff A' = {M ', F', unit', type', int') is an 
answer set graph for £ where (a) M' = M U {to'} for some new vertex m! £lid \ M, 
(b) F' = F U {(to', TOj) | 1 < i < k}, (c) unit ' = unit U {(to', u)}, (d) type' = type U 
{(to', i)}, and (e) int' = int U {(to', J)}. 


Example 26 ( ctd .) 

Imagine that I 2 has no interpretations at 144 . The following candidate pairs of o-inter- 
pretations exist for creating i-interpretations at up. me txi TOg, me ix toio, me>timii, 
and me ix TO12. A seen in Example [24] TO13 = me 1 x 1 TOn and TO14 = me 1 x 1 mi2 are 
the only joins at U 4 that are defined. In Example 25 we have seen that AS (uf ) = 
{mi + (77113), int + (777,14) }, and due to (IG-U), we cannot have additional i-interpretations 
with the same content. 


5.3.1 Complete Answer Set Graphs 
We next introduce a notion of completeness for answer set graphs. 

Definition 20 

Let A = (M, F, unit, type, int) be an answer set graph for an evaluation graph £ = (U, E) 

and let u £ U. Then 

• A is input-complete for u, if {int + (m) | to £ i-ints^(u)} = AS(m < ), and 

• A is output-complete for u, if {int + (m) \ to £ o-ints^u)} = AS(u~). 

If an answer set graph is complete for all units of its corresponding evaluation graph, 
answer sets of the associated program can be obtained as follows. 

Theorem 5 

Let £ = ( U, E), where U = {ui,..., u„ } , be an evaluation graph of a program P, and let 
A = ( M , F, unit, type, int) be an answer set graph that is output-complete for every unit 
u £ U. Then 

AS(P) = {UIL .j jnl(mj) | mi £ o-intSj{(ui), 1 <i< n,\o-ints^(ui)\ = lj, (4) 

where Al is the subgraph of A consisting of all interpretations that are reachable in A from 
some interpretation mi,..., m n . 


Example 27 (ctd.) 

In I 2 we first choose TO 15 £ o-ints{uf), which is the only o-interpretation at U 4 . The 
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subgraph reachable from to 15 must contain exactly one o-interpretation at each unit; we 
thus must choose every o-interpretations m such that rn-\ 5 —> + m. Hence we obtain 

{mt(TO 3 ) U int(mo) U int(rri \\) U int(m\ 5 )} 

= {{swim(outd)} U 0 U { goto(altD ), ngoto(gansD), go} LI { need(loc, yogamat)}} 
= {{ swim(outd ), goto(altD), ngoto(gansD), go, need(loc, yogamat)}} 

which is indeed the set of answer sets of P sw im- 

The rather involved set construction in ([4]» establishes a relationship between answer sets 
of a program and complete answer set graphs that resembles condition (IG-F) of i-graphs. 
To obtain a more convenient way to enumerate answer sets, we can extend an evaluation 
graph always with a single void unit Ufi na i that depends on all other units in the graph (i.e., 
(u final, u) G E for each u £ U \ {ufi na i}), which we call ufinai unit ; the answer sets of P 
correspond then directly to i-interpretations at Ufi na i- Formally, 

Proposition 6 

Let A = (M, F, unit, type, int ) be an answer set graph for an evaluation graph £ = (U, E) 
of a program P, where £ contains a final unit Ufi na i, and assume that A is input-complete 
for U and output-complete for U \ { Ufinai. \ ■ Then 

AS{P) = {int{m) | to G i-ints A (ufi na i)}. (5) 

Expanding i-interpretations at Ufi na i is not necessary, as Ufi na i depends on all other units; 
thus for every to G i-ints^(ufi na i) it holds that int + (m) = int(m). 

We will use the technique with Ufi na i for our model enumeration algorithm; as the join 
condition must be checked anyways, this technique is an efficient and simple method for 
obtaining all answer sets of a program using an answer set graph without requesting an 
implementation of the conditions in Theorem[5] 

5.4 Answer Set Building 

Thanks to the results above, we can obtain the answer sets of a HEX-program from any 
answer set graph for it. To build an answer set graph, we proceed as follows. We start with 
an empty graph, obtain o-interpretations by evaluating a unit on an i-interpretation, and then 
gradually generate i-interpretations by joining o-interpretations of predecessor units in an 
evaluation graph at hand. 

Towards an algorithm for evaluating a HEX-program based on an evaluation graph, we 
use a generic grounding algorithm GROUNDHEX for lde-safe programs, and a solving 
algorithm EVALUATEGROUNDHEX which returns for a ground HEX-program P its answer 
sets AS(P). We assume that they satisfy the following properties. 

Property 1 

Given an lde-safe program P, GroundHEX(P) returns a finite ground program such that 
AS{P) = AS(GroundHEX(P)). 

Property 2 

Given a finite ground HEX-program P, EvaluateGroundHEX(P) = AS(P). 
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Algorithm 1: EvaluateLDESafe 


Input: A liberally de-safe HEX-program P, an input interpretation / 
Output: All answer sets of P U facts(I) without I 

// add input facts and ground, cf. 

P' <- GroundHEX(P U facts(I)) 

// evaluate the ground program, cf 
// and perform output projection 


(Eiter et al. 

2014a 

) 





( 

Eiter et al 

2014b) 


return {P \ I \ I' e EvaluateGroundHEX(P')} 


Concrete such algorithms are given in ( Eiter et al. 2014a| l and ( Eiter et al. 2014b| l, respec¬ 
tively. Since the details of these algorithms are not relevant for the further understanding of 
this paper, we give here only an informal description and refer the interested reader to the 
respective papers. The idea of the grounding algorithm is to iteratively extend the grounding 
by expanding the set of constants until it is large enough to ensure that it has the same 
answer sets as the original program. To this end, the algorithm starts with the constants in 
the input program only, and in each iteration of the algorithm it evaluates external atoms a 
(finite) number of relevant inputs in order to determine additional relevant constants. Under 
the syntactic restrictions recapitulated in the preliminaries, this iteration will reach a fixpoint 
after finitely many steps. The solving algorithm is based on conflict-driven clause learning 
(CDCL) and lifts the work of |Gebser et al. (20l2| from ordinary to HEX programs. The main 
idea is to learn not only conflict clauses, but also (parts of) the behavior of external sources 
while the search space is traversed. The behavior is described in terms of input-output 
relations, i.e., certain input atoms and constants lead to a certain output of the external atom. 
This information is added to the internal representation of the program such that guesses for 
external atoms that violate the known behavior are eliminated in advance. 

By composing the two algorithms, we obtain Algorithm[I|for evaluating a single unit. 
Formally, it has the following property. 


Proposition 7 

Given an lde-safe HEX-program P and an input interpretation /, Algorithm [T]returns the set 
{I' \ I | P € AS(P U facts(I))}, i.e., the answer sets of P augmented with facts for the 
input /, projected to the non-input. 


We are now ready to formulate an algorithm for evaluating HEX programs that have been 
decomposed into an evaluation graph. 

To this end, we build first an evaluation graph £ and then compute gradually an answer 
set graph A = ( M, F, unit, type, ini) based on £, proceeding along already evaluated units 
towards the unit Ufi na i. Algorithm [2] shows the model building algorithm in pseudo-code, 
in which the positive integers M = {1,2,...} are used as identifiers I l( i and max (AT) is 
maximum in any set M C N where, by convention, max(0) = 0. Intuitively, the algorithm 
works as follows. The set U contains units for which A is not yet output-complete (see 
Definition |20[>; we start with an empty answer set graph A, thus initially U = V. In each 
iteration of the while loop |(a)| a unit u that is not output-complete and depends only on 
output-complete units is selected. The first for loop |(c)| makes u input-complete; if u is the 
final unit, the answer sets are returned in |(d )[ otherwise the second for loop|(e)|makes u 
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Algorithm 2: BuildAnswerSets 

Input: £ = (V. E): evaluation graph for HEX program P, which contains a unit Ufi na i 
that depends on all other units in V 
Output: a set of all answer sets of P 
M := 0, F := 0, unit := 0, type := 0, int := 0, U := V 


(a) while U 0 do 

choose u £ U s.t. precis £ (u) D U = 0 
let {ui,..., Uk} = preds £ (u) 


(b) 


if k = 0 then 

to := max(M) + 1 
M := M U {to} 

unit(m) := u, type(m) := I, int(m ) := 0 


else 


(c) 


for toi £ o-ints(ui ), ..., m.k £ o-ints(uk) do 
if J = toi cxi • • ■ M TOfe A defined then 
to := max(M) + 1 

M ■= M U {to}, F := F U {(to, ?n,;) | 1 < i < fc} 
unit(m) := u, type(m) := I, int{m) := J 


(d) 

(e) 


(f) 


if u — Cj,,,,, i then 
|_ return i-ints(ufi na i) 

for m' £ i-ints(u ) do 

O := EvaluateLDESafe(u, int(m')) 
for o £ O do 

to- := max(M) + 1 
M := M U {to}, F := F U {(to,?tt')} 
unit(m) := u, type{m) := O, int(m ) := o 

C/:=C/\{w} 


output-complete, and then u is removed from U. Each iteration makes one unit input- and 
output-complete; hence when the algorithm reaches Ufi na i and makes it input-complete, all 
answer sets can directly be returned in (d) Formally, we have 


Theorem 6 

Given an evaluation graph £ = ( V, F) of a HEX program P, BuildAnswerSets(£) 
returns .4S(P). 


A run of the algorithm on our running example using the evaluation graph £2 extended 


with a final unit is given in Appendix B 


5.4.1 Model Streaming 

Algorithm BuildAnswerSets as described above keeps all answer sets in memory, and 
it evaluates each unit only once wrt. every possible i-interpretation. This may lead to a 
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resource bound excess, as in general an exponential number of answer sets respectively 
interpretations at evaluation units are possible. However, keeping the whole answer set 
graph in memory is not necessary for computing all answer sets. 

We have realized a variant of Algorithm BuildAnswerSets that uses the same princi¬ 
ple of constructing an answer set graph, interpretations are created at a unit on demand when 
they are requested by units that depend on it; furthermore, the algorithm keeps basically 
only one interpretation at each evaluation unit in memory at a time, which means that 
interpretations are provided in a streaming fashion one by one, and likewise the answer 
sets of the program at the unit Ufi na i, where the model building starts. Such answer set 
streaming is particularly attractive for applications, as one can terminate the computation 
after obtaining sufficiently many answer sets. On the other hand, it comes at the cost of 
potential re-evaluation of units wrt. the same i-interpretation, as we need to trade space 
for time. However, in practice this algorithm works well and is the one used in the dlvhex 


prototype. We describe this algorithm in Appendix C 


6 Implementation 

In this section we give some details on the implementation of the techniques. Our prototype 
system is called dlvhex; it is written in C++ and online available as open-source software]^] 
The current version 2.4.0 was released in September 2014. 

We first describe the general architecture, the major components, and their interplay 
(Section|6Tj. Then we give an overview about the existing heuristics for building evaluation 
graphs (Section |fx2] |. Experimental results are presented and discussed in Section 6.3 For 
details on the usage of the system, we refer to the website; an exhaustive description of the 
supported command-line parameters is output when the system is called without parameters. 


6.1 System Architecture 

The dlvhex system architecture is shown in Figure [9] The arcs model both control and data 
flow within the system. The evaluation of a HEX-program works as follows. 

First, the input program is passed to the evaluation framework Q, which creates an 
evaluation graph depending on the chosen evaluation heuristics. This results in a number 
of interconnected evaluation units. While the interplay of the units is managed by the 
evaluation framework, the individual units are handled by model generators of different 
kinds. 

Each instance of a model generator realizes EvaluateFDESafe (Algorithm[T]> for a 
single evaluation unit, receives input interpretations from the framework (which are either 
output by predecessor units or come from the input facts for leaf units), and sends output 
interpretations back to the framework ©, which manages the integration of the latter to 
final answer sets and realizes BuildAnswerSets (Algorithm^. 

Internally, the model generators make use of a grounder and a solver for ordinary ASP 
programs. The architecture of our system is flexible and supports multiple concrete backends 
that can be plugged in. Currently it supports dlv, gringo 4.4.0 and clasp 3.1.0, as well as an 

8 http://www.kr.tuwien.ac.at/research/systems/dlvhex 
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internal grounder and a solver that were built from scratch (mainly for testing purposes); they 
use basically the same core algorithms as gringo and clasp, but without optimizations. The 
reasoner backends gringo and clasp are statically linked to our system; thus no interprocess 
communication is necessary. The model generator within the dlvhex core sends a non-ground 
evaluation unit to the HEX-grounder which returns a ground evaluation unit ©. The HEX- 
grounder in turn uses one of the above mentioned ordinary ASP grounders as backend © 
and accesses external sources to handle newly introduced constants that are not part of the 
input program (called value invention) ©. The ground evaluation unit is then sent to the 
ASP solver and answer sets of the ground unit are returned ©. 

Intuitively, model generators evaluate evaluation units by replacing external atoms by 
ordinary ‘replacement’ atoms, guessing their truth value, and making sure that the guesses 
are correct with respect to the external oracle functions. To achieve that, the solver backend 
needs to make callbacks to the Post Propagator in the dlvhex core during model building. 
The Post Propagator checks guesses for external atoms against the actual semantics and 
checks the minimality of the answer set. It processes a complete or partial model candidate, 
and returns learned nogoods to the external solver © as formalized in (Eiter et al. 2012 1 . 
The dlv backend calls the Post Propagator only for complete model candidates, the internal 
solver and the clasp backend also call it for partial model candidates of evaluation units. 
For the clasp backend, we exploit its SMT interface, which was previously used for the 
special case of constraint answer set solving ( Gebser et al, 2009bj ). Verifying guesses of 
replacement atoms requires calling plugins that implement the external sources (i.e., the 
oracle functions F&g from Definition [3j ©. Moreover, the Post Propagator also ensures 
answer set minimality by eliminating unfounded sets that are caused by external sources 


and therefore can not be detected by the ordinary ASP solver backend (as shown by Eiter 


et al. (2014b l). Finally, as soon as the evaluation framework obtains an i-interpretation of 


the final evaluation unit Ufi na i, this i-interpretation (which is an answer set according to 
Proposition| 6 | is returned to the user ©. 


6.2 Heuristics 

As for creating evaluation graphs, several heuristics have been implemented. A heuristics 
starts with the rule dependency graph as by Definition [TO] and then acyclically combines 
nodes into units. 

Some heuristics are described in the following. 

HO is a ‘trivial’ heuristics that makes units as small as possible. This is useful for debugging, 
however it generates the largest possible number of evaluation units and therefore incurs a 
large overhead. As a consequence HO performs clearly worse than other heuristics and we 
do not report its performance in experimental results. 

HI is the evaluation heuristics of the dlvhex prototype version 1. HI makes units as large 
as possible and has several drawbacks as discussed above. 

H2 is a simple evaluation heuristics which has the goal of finding a compromise between 
the HO and HI. It places rules into units as follows: 

(i) it puts rules r 1; r 2 into the same unit whenever rq —» TO) „ s and r 2 s for some 

rule s and there is no rule t such that exactly one of r\ , r 2 depends on t; 
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Fig. 9: Architecture of dlvhex 


(ii) it puts rules n, 7~2 into the same unit whenever s —t m , n r 1 and s —} m ,n r '2 for some 
rule s and there is no rule t such that t depends on exactly one of rq. r' 2 ; but 

(iii) it never puts rules r, s into the same unit if r contains external atoms and r -^ m ,n s. 


Intuitively, H2 builds an evaluation graph that puts all rules with external atoms and their 
successors into one unit, while separating rules creating input for distinct external atoms. 
This avoids redundant computation and joining unrelated interpretations. 


H3 is a heuristics for finding a compromise between (1) minimizing the number of units, 
and (2) splitting the program whenever a de-relevant nonmonotonic external atom would 
receive input from the same unit. We mention this heuristics only as an example, but 
disregard it in the experiments since it was developed in connection with novel ‘liberal’ 
safety criteria ( jEiter et al, 2013j l that are beyond the scope of this paper. H3 greedily gives 
preference to (1) and is motivated by the following considerations. The grounding algorithm 
by [Eiter et al. (2014a] ) evaluates the external sources under all interpretations such that the 
set of observed constants is maximized. While monotonic and antimonotonic input atoms 
are not problematic (the algorithm can simply set all to true resp. false), nonmonotonic 
parameters require an exponential number of evaluations in general. Thus, although program 
decomposition is not strictly necessary for evaluating liberally safe HEX-programs, it is still 
useful in such cases as it restricts grounding to those interpretations that are actually relevant 
in some answer set. However, on the other hand it can be disadvantageous for propositional 
solving algorithms such as those in ( Eiter et al. 2012| . 

Program decomposition can be seen as a hybrid between traditional and lazy grounding 
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(cf. e.g. Palu et al. (2009| >), as program parts are instantiated that are larger than single rules 
but smaller than the whole program. 


6.3 Experimental Results 


In this section, we evaluate the model-building framework empirically. To this end, we 
compare the following configurations. In the HI column, we use the previous state-of-the-art 
evaluation method ( (Schindlauer 2006) before the framework in Section[5]was developed. 
This previous method also makes use of program decomposition. However, in contrast to 
our new framework, the decomposition is based on atom dependencies rather than rule 
dependencies, and the decomposition strategy is hard-coded and not customizable. This 
evaluation method corresponds to heuristics HI in our new framework. 

In the w/o framework column, we present the results without application of the framework 
using the HEX-program evaluation algorithm by Eiter et al. (2014aj ) which allows to first 
instantiate and then solve the instantiated HEX-program. Note that before this algorithm was 
developed, such a ‘two-phase’ evaluation was not possible since program decomposition was 
necessary for grounding purposes. With the algorithm in ( Eiter et al. 2014a] ), decomposition 
is not necessary anymore, but can still be useful as the results in the H2 column shows, 
which correspond to the results when applying the heuristics H2 described above. 

The configuration of the grounding algorithm and the solving algorithm (e.g., conflict- 
driven learning strategies) also influence the results. Moreover, in addition to the default 
heuristics of framework, other heuristics have been developed as well and the best selection 
of the heuristics often depends on the configuration of the grounding and the solving 
algorithm. Since they were used as black boxes in Algorithm [l] an exhaustive experimental 
analysis of the system is beyond the scope of this paper and would require an in-depth 
description of these algorithms. Thus, we confine the discussion to the default settings, which 
suffices to show that the new framework can speed up the evaluation significantly. The only 
configuration difference between the result columns HI and H2 is the evaluation heuristics, 
all other parameters are equal. Evaluating the w/o framework column requires the grounding 
algorithm from ( Eiter et al. 2014a) > instead of evaluation via decomposition, therefore w/o 
framework does not use any heuristics. The solver backend (clasp) configuration is the same 
in HI. H2, and w/o framework. We use the streaming algorithm (see Appendix Q in all 
experiments. For an in depth discussion, we refer to Eiter et al.|( 2014b| 2014aj) and Redl 
|(2014| ), where the efficiency was evaluated using a variety of applications including planning 
tasks (e.g., robots searching an unknown area for an object, tour planning), computing 
extensions of abstract argumentation frameworks, inconsistency analysis in multi-context 
systems, and reasoning over description logic knowledge bases. 

We discuss here two benchmark problems, which we evaluated on a Linux server with 
two 12-core AMD 6176 SE CPUs with 128GB RAM running dlvhex version 2.4.0. and an 
HTCondor load distribution systerrj^jthat ensures robust runtimes. The HTCondor system 
ensures that multiple runs of the same instance have negligible deviations in the order of 
fractions of a second, thus we can restrict the experiments to one run. The grounder and 
solver backends for all benchmarks are gringo 4.4.0 and clasp 3.1.1. For each instance, we 


9 http://research.cs.wisc.edu/htcondor 
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Topology and 

Instance Size 


First Answer Set 



All Answer Sets 


HI 

w/o framework 

H2 

HI 

w/o framework 

H2 

d-7-7-3-3 (10) 

1.23 (0) 

0.29 (0) 

0.38 (0) 

4.93 (0) 

0.76 (0) 

0.79 (0) 

d-7-7-4-4 (10) 

18.43 (0) 

1.09(0) 

0.76 (0) 

50.78 (0) 

3.39 (0) 

1.80 (0) 

d-7-7-5-5 (10) 

94.18(1) 

3.60 (0) 

1.52 (0) 

289.35 (4) 

20.21 (0) 

4.97 (0) 

h-9-9-3-3 (10) 

83.17(1) 

3.77 (0) 

0.70 (0) 

300.96 (4) 

28.67 (0) 

2.11 (0) 

h-9-9-4-4 (10) 

389.74 (6) 

30.56 (0) 

2.14(0) 

555.94 (9) 

335.11 (5) 

12.56 (0) 

r-7-7-4-4 (10) 

39.27 (0) 

2.82 (0) 

0.33 (0) 

366.17 (5) 

57.26 (0) 

2.06 (0) 

r-7-7-5-5 (10) 

389.88 (6) 

105.80(1) 

0.93 (0) 

600.00(10) 

377.37 (5) 

4.39 (0) 

r-7-8-5-5 (10) 

226.04 (3) 

25.11 (0) 

0.57 (0) 

541.80 (9) 

317.64(5) 

3.99 (0) 

r-7-9-5-5 (10) 

355.37 (5) 

145.99 (2) 

0.87 (0) 

600.00(10) 

458.14(7) 

5.42 (0) 

r-8-7-5-5 (10) 

502.64 (8) 

329.47 (5) 

1.21 (0) 

555.26 (9) 

443.15 (7) 

5.84 (0) 

r-8-8-5-5 (10) 

390.81 (6) 

201.08 (3) 

1.00 (0) 

600.00(10) 

495.41 (8) 

5.38 (0) 

z-7-7-3-3 (10) 

2.34 (0) 

0.32 (0) 

0.44 (0) 

9.17 (0) 

1.13(0) 

1.00(0) 

z-7-7-4-4 (10) 

33.32 (0) 

1.58 (0) 

1.07 (0) 

182.44 (2) 

9.00 (0) 

2.67 (0) 

z-7-7-5-5 (10) 

164.33 (2) 

12.69 (0) 

3.52 (0) 

502.49 (8) 

89.01 (1) 

6.90 (0) 


Table 1: MCS experiments: variable topology (d, h, r, z) and instance size. 


limited the CPU usage to two cores and 8GB RAM. The timeout for each instance was 600 
seconds. Each line shows the average runtimes over all instances of a certain size, where 
each timeout counts as 600 seconds. While instances usually become harder with larger 
size, there might be some exceptions due to the randomly generated instances; however, the 
overall trend shows that runtimes increase with the instance size. Numbers in parentheses 
are the numbers of instances of respective size in the leftmost column and the numbers of 
timeout instances elsewhere. The generators, instances and external sources are available at 
http://www.kr.tuwien.ac.at/research/projects/hexhex/hexframework 


6.3.1 Multi-Context Systems (MCS) 


The MCS benchmarks originate in the application scenario of enumerating output-projected 
equilibria (i.e., global models) of a given multi-context system (MCS) (cf. Section |2.3.2| i. 
Each instance comprises 7-9 contexts (propositional knowledge bases) whose local seman¬ 
tics is modeled by external atoms; roughly speaking, they single out assignments to the 
atoms of a context occurring in bridge rules such that local models exist. For each context, 
5-10 such atoms are guessed and bridge rules, which are modeled by ordinary rules, are 


randomly constructed on top. The MCS instances were generated using the DMCS (Bairak 


dar et al. 2010) instance generator, with 10 randomized instances for different link structure 
between contexts (diamond (d), house (h), ring (r), zig-zag (z)) and system size; they have 


between 4 and about 20,000 answer sets, with an average of 400. We refer to (Bairakdar 
et al. 2010) and I Schuller 2012j ) for more details on the benchmarks and the HEX-programs. 

Table |T| shows the experimental results: computation with the old method HI often 
exceeds the time limit, while the new method H2 manages to enumerate all solutions of all 
instances. Monolithic evaluation without decomposition shows a performance between the 
old and new method. These results show that our new evaluation method is essential for 
using HEX to computationally realize the MCS application. 
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Instance Size 

First Answer Set 


All Answer Sets 


HI 

w/o framework 

H2 

HI 

w/o framework 

H2 

HD 

2.84 (0) 

3.14(0) 

2.78 (0) 

2.73 (0) 

3.14(0) 

2.79 (0) 

2(1) 

6.13(0) 

7.18(0) 

4.90 (0) 

6.05 (0) 

7.17(0) 

4.88 (0) 

3(1) 

10.18(0) 

12.30 (0) 

8.32 (0) 

10.25 (0) 

12.35 (0) 

8.37 (0) 

4(1) 

15.92 (0) 

18.66 (0) 

12.12(0) 

15.86 (0) 

18.85 (0) 

12.16(0) 

5(1) 

26.06 (0) 

28.47 (0) 

17.17(0) 

26.23 (0) 

28.35 (0) 

17.06 (0) 

6(1) 

47.06 (0) 

45.71 (0) 

23.39 (0) 

46.84 (0) 

45.62 (0) 

23.26 (0) 

7(1) 

92.76 (0) 

79.41 (0) 

31.19(0) 

96.56 (0) 

79.82 (0) 

31.04(0) 

8(1) 

198.59 (0) 

155.10(0) 

37.85 (0) 

199.74 (0) 

155.26 (0) 

38.06 (0) 

9(1) 

600.00(1) 

600.00(1) 

46.61 (0) 

600.00(1) 

600.00(1) 

46.75 (0) 

10(1) 

600.00(1) 

600.00 (1) 

57.48 (0) 

600.00(1) 

600.00(1) 

57.40 (0) 

11(1) 

600.00(1) 

600.00 (1) 

68.98 (0) 

600.00(1) 

600.00(1) 

69.45 (0) 

12(1) 

600.00(1) 

600.00(1) 

84.41 (0) 

600.00(1) 

600.00(1) 

84.11 (0) 

13(1) 

600.00(1) 

600.00 (1) 

99.55 (0) 

600.00(1) 

600.00(1) 

99.52 (0) 

14(1) 

600.00(1) 

600.00 (1) 

117.39 (0) 

600.00(1) 

600.00(1) 

117.15(0) 

15(1) 

600.00(1) 

600.00 (1) 

138.45 (0) 

600.00(1) 

600.00(1) 

137.51 (0) 

16(1) 

600.00(1) 

600.00 (1) 

163.12(0) 

600.00(1) 

600.00(1) 

158.43 (0) 

17(1) 

600.00(1) 

600.00(1) 

184.99 (0) 

600.00(1) 

600.00(1) 

181.94(0) 

18(1) 

600.00(1) 

600.00 (1) 

208.83 (0) 

600.00(1) 

600.00(1) 

210.82 (0) 

19(1) 

600.00(1) 

600.00 (1) 

236.98 (0) 

600.00(1) 

600.00(1) 

237.45 (0) 

20(1) 

600.00(1) 

600.00 (1) 

267.54 (0) 

600.00(1) 

600.00(1) 

268.60 (0) 

21(1) 

600.00(1) 

600.00 (1) 

600.00(1) 

600.00(1) 

600.00(1) 

600.00(1) 


Table 2: RSTrack experiments: variable number of conference tracks, single answer set. 


6.3.2 Reviewer Selection (RS ) 

Our second benchmark is Reviewer Selection (RS): we represent c conference tracks, r 
reviewers and p papers. Papers and reviewers are assigned to conference tracks, and there 
are conflicts between reviewers and papers, some of which are given by external atoms. We 
consider two scenarios: RSTrack and RSPaper. They are designed to measure the effect 
of external atoms on the elimination of a large number of answer set candidates; in contrast 
to the MCS experiments we can control this aspect in the RS experiments. 

In RSTrack we vary the number c of conference tracks, where each track has 20 papers 
and 20 reviewers. Each paper must get two reviews, and no reviewer must get more than 
two papers. Conflicts are dense such that only one valid assignment exists per track, hence 
each instance has exactly one answer set, and in each track two conflicts are external. For 
each number c there is only one instance because RSTrack instances are not randomized. 
The results of RSTrack are shown in Table |2j runtimes of the old evaluation heuristics 
(HI ) grow fastest with size, without using decomposition grows slightly slower but also 
reaches timeout at size 9. Only the new decomposition (H2 heuristics) can deal with size 20 
without timeout. Finding the first answer set and enumerating all answer sets show very 
similar times, as RSTrack instances have a single answer set and finding it seems hard. 

In RSPaper we fix the number of tracks to c = 5; we vary the number p of papers in each 
track and set the number of reviewers to r=p. Each paper must get three reviews and each 
reviewer must not get more than three papers assigned. Conflicts are randomized and less 
dense than in RSTrack: the number of answer sets is greater than one and does not grow 
with the instance size. Over all tracks and papers, 2 p randomly chosen conflicts are external, 
and we generate 10 random instances per size and report results averaged per instance size 
in Table [3] As clearly seen, our new method is always faster than the other methods, and 
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Instance Size 

First Answer Set 



All Answer Sets 



HI 

w/o framework 

H2 

HI 

w/o framework 

H2 


5(10) 

1.06 (0) 

0.28 (0) 

0.21 (0) 

2.25 (0) 

0.43 (0) 

0.23 

(0) 

8(10) 

8.76 (0) 

2.73 (0) 

0.38 (0) 

14.73 (0) 

4.54 (0) 

0.44 

(0) 

11(10) 

108.70 (1) 

83.26 (1) 

0.98 (0) 

171.01 (2) 

104.84 (1) 

1.28 

(0) 

14(10) 

180.99 (2) 

125.83 (1) 

2.08 (0) 

299.22 (4) 

245.62 (3) 

2.67 

(0) 

17(10) 

418.92 (6) 

364.95 (5) 

5.15(0) 

549.01 (9) 

513.21 (8) 

8.14 

(0) 

20(10) 

485.35 (8) 

453.39 (7) 

7.32 (0) 

507.66 (8) 

501.74 (8) 

14.45 

(0) 

23 (10) 

542.03 (9) 

508.75 (8) 

13.91 (0) 

600.00(10) 

600.00(10) 

23.16 

(0) 

26(10) 

600.00(10) 

600.00 (10) 

33.20 (0) 

600.00 (10) 

600.00(10) 

154.51 

(2) 

29 (10) 

600.00(10) 

600.00 (10) 

60.78 (0) 

600.00(10) 

600.00 (10) 

108.03 

(0) 

32(10) 

600.00(10) 

600.00 (10) 

129.95 (0) 

600.00 (10) 

600.00(10) 

315.56 

(4) 

35 (10) 

600.00(10) 

600.00 (10) 

136.84 (0) 

600.00 (10) 

600.00(10) 

302.90 

(3) 

38(10) 

600.00(10) 

600.00 (10) 

308.92 (3) 

600.00(10) 

600.00(10) 

441.06 

(6) 

41 (10) 

600.00(10) 

600.00 (10) 

421.69 (6) 

600.00 (10) 

600.00(10) 

529.80 

(8) 

44(10) 

600.00(10) 

600.00 (10) 

470.61 (7) 

600.00(10) 

600.00(10) 

553.19 

(9) 

47 (10) 

600.00(10) 

600.00 (10) 

485.60 (7) 

600.00 (10) 

600.00(10) 

529.00 

(8) 

50(10) 

600.00 (10) 

600.00 (10) 

485.07 (7) 

600.00 (10) 

600.00(10) 

526.66 

(8) 


Table 3: RSPaper experiments: variable number of papers/reviewers, multiple answer sets, 
randomized. 


evaluation without a decomposition framework performs slightly better than the old method. 
Different from RSTrack, we can see a clear difference between finding the first answer 
set and enumerating all answer sets as RSPaper instances have more than one answer set. 

To confirm that the new method is geared towards handling many external atoms, we 
conducted also experiments with instances that had few external atoms for eliminating 
answer set candidates but many local constraints. For such highly constrained instances, the 
new decomposition framework is not beneficial as it incurs an overhead compared to the 
monolithic evaluation that increases runtimes. 


6.3.3 Summary 


The results demonstrate a clear improvement using the new framework; they can often be 
further improved by fine-tuning the grounding and solving algorithm, and by customizing 
the default heuristics of the framework, as discussed by Eiter et al. {2014 b} 2014a| , and 
Redl (2014) . However, already the default settings yield results that are significantly better 
than using the previous evaluation method or using no framework at all; note that the latter 
requires an advanced grounding algorithm as by |Eiter et al. (2014a] >, which was not available 
at the time the initial evaluation approach as by |Schindlauer (2006| > was developed. 

In conclusion, the evaluation framework in Section [5] pushes HEX-programs towards 
scalability for realistic instance sizes, which previous evaluation techniques missed. 


7 Related Work and Discussion 

We now discuss our results in the context of related work, and will address possible 
optimizations. 
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7.1 Related Work 

We first discuss related approaches for integrating external reasoning into ASP formalisms, 
then we discuss work related to the notion of rule dependencies that we introduced in 


Section 4.1 we discuss related notions of modularity and program decomposition. Finally 


we relate our splitting theorems to other splitting theorems in the literature. 


7.1.1 External Sources 


The dlv-ex system (Calimeri et al. 2007) was a pioneering work on value invention through 


external atoms in ASP. It supported Vi-restricted programs, which amount to HEX-programs 
under extensional semantics without higher-order atoms and a strong safety condition that 
is subsumed by Ide-safety. Answer set computation followed the traditional approach on top 
of dlv, but used a special progressive grounding method (thus an experimental comparison 
to solving, i.e., model building as in the focus of this paper, is inappropriate). 

With respect to constraint theories and ASP, several works exist. The ACsolver system 


(Mellarkod et al. 2008 1 , the Ezcsp system (Balduccini 20091, and the clingcon system (Os 


trowski and Schaub 2012) divide the program into ASP-literals and constraint-literals, which 
can be seen as a special case of HEX-programs that focuses on a particular external source. 
As for evaluation, an important difference to general external sources is that constraint atoms 
do not use value invention. The modularity techniques from above are less relevant for this 
setting as grounding the overall program in one shot is possible. However, this also fits 
into our framework as disabling decomposition in fact corresponds to a dedicated (trivial) 
heuristics which keeps the whole program as a single unit. For a detailed comparison 
between ACsolver, Ezcsp, and clingcon see (|Lierler 2014ji. 


Balduccini and Lierler (2013) also experimentally compared Ezcsp and clingcon while 


varying the degree of integration between the constraint solver and the ASP solver backend. 
Their ‘black-box integration’ corresponds with dlvhex’s integration of the dlv backend: 
external atom semantics are verified by plugins callbacks only when a full answer set 
candidates has been found in the backend; moreover their ‘clear-box integration’ corresponds 
with dlvhex’s integration of the clasp solver backend: plugin callbacks are part of the CDCL 
propagation and can operate on partial answer set candidates. Note that constraint answer 
set programs can be realized as a dlvhex-plugin (such an effort is currently ongoing)^] 

We also remark that gringo and clasp use a concept called ‘external atoms’ for realizing 
various applications such as constraint ASP solving as in clingcon and incremental solv¬ 
ing ( Gebser et al, 2014) . However, despite their name they are different from external atoms 
in HEX-programs. In the former case, external atoms are excluded from grounding-time 
optimization such that these atoms are not eliminated even if their truth value is determinis¬ 
tically false during grounding. This allows to add rules that found truth of such atoms in 
later incremental grounding steps. In case of HEX the truth value is determined by external 
sources. Moreover gringo contains an interface for Lua and Python functions that can per¬ 
form computations during grounding. HEX external atoms are more expressive: they cannot 
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always be evaluated during grounding because their semantics is defined with respect to the 
answer set. 


7.1.2 Rule Dependencies 

In the context of answer set programming, dependency graphs over rules have been used 
earlier, e.g., by |Linke (200 1| > and |Linke~an d Sarsakov (2004). However, these works consider 
only ordinary ground programs, and furthermore the graphs are used for characterizing 
and computing the answer sets of a program from these graphs. In contrast, we consider 
nonground programs with and external atoms, and we use the graph to split the program 
into evaluation units with the goal of modularly computing answer sets. 


7.1.3 Modularity 


Our work is naturally related to work on program modularity under stable model semantics, 


as targeted by splitting sets (Lifschitz and Turner 1994 1 and descendants, with the work 


by jOikarinen and Janhunen (2008 1 and Janhunen et al. (2009) 1 a prominent representative 
that lifted them to modular programs with choice rules and disjunctive rules, by considering 


‘symmetric splitting’. Other works, e.g., by Lierler and Truszczynski (2013) go further 


to define semantics of systems of program modules, departing from a mere semantics¬ 
preserving decomposition of a larger program into smaller parts, or consider multi-language 
systems that combine modules in possibly different formalisms on equal terms (cf. e.g. 
|Jarvisalo et al. (20091 and |Tasharrofi and Ternovska (2011| ). 

Comparing the works by Oikarinen and Janhunen (20081 and Janhunen et al, (2009| as, 
from a semantic decomposition perspective, the closest in this group to ours, an important 
difference is that our approach works for non-ground programs and explicitly considers 
possible overlaps of modules. It is tailored to efficient evaluation of arbitrary programs, 
rather than to facilitate module-style logic programming with declarative specifications, 
or to provide compositional semantics for modules beyond uni-lateral evaluation, as done 
by [Jarvisalo et al. (20091 and |Tasharrofi and Ternovska (2011 >; for them, introducing values 
outside the module domain (known as value invention ) does not play a visible role. In 


this regard, it is in line with previous HEX-program evaluation (Eiter et al. 2006 1 and 
decomposition techniques to ground ordinary programs efficiently (|Calimeri et al. 2008). 


7.1.4 Splitting Theorems 

Our new splitting theorems compare to related splitting theorems as follows. 

Theorem[I]is similar to Theorem 4.6.2 by |Schindlauer (2006j l; however, we do not use 
splitting sets on atoms, but splitting sets on rules. Furthermore, |Schindlauer (2006j ) has no 
analog to Theorem [2] 

The seminal Splitting Set Theorem by [Lifschitz and Turner (1994| divides the interpreta¬ 
tion of P into disjoint sets X and Y, where X is an answer set of the ‘bottom’ gb A {P ) C P 
and Y is an answer set of a ‘residual’ program obtained from P \ gb A (P ) and X. In the 
residual program, all references to atoms in X are removed, in a way that it semantically 
behaves as if facts X were added to P\ gh A (/’), while the answer sets of the residual do 
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not contain any atom in X. This works nicely for answer set programs, but it is problematic 
when applied to HEX programs, because external atoms may depend on the bottom and 
on atoms in heads of the residual program; hence, they cannot be eliminated from rale 
bodies. The only way to eliminate bottom facts from the residual program would be to 
‘split’ external atoms semantically into a part depending on the bottom and the program 
remainder, and by replacing external atoms in rules with external atoms that have been 
partially evaluated wrt. a bottom answer set. Technically, this requires to introduce new 
external atoms, and formulating a splitting theorem for HEX programs with two disjoint 
interpretations X and Y is not straightforward. Furthermore, such external atom splitting 
and partial evaluation might not be possible in a concrete application scenario. 

Different from the two splitting theorems recalled above, the Global Splitting Theorem 
by Eiter et al. (2006} > does not split an interpretation of the program P into disjoint inter¬ 
pretations X and Y, and thus should be compared to our Theorem[2] However, the Global 
Splitting Theorem does not allow constraint sharing, and it involves a residual program 
which specifies how external atoms are evaluated via ‘replacement atoms’, which lead to 
extra facts D in the residual program that must be removed from its answer sets. Both the 
specification of replacement atoms and the extra facts make the Global Splitting Theorem 
cumbersome to work with when proving correctness of HEX encodings. Moreover, the 
replacement atoms are geared towards a certain implementation technique which however 
is not mandatory and can be avoided. 

Lemma 5.1 by Eiter et al. (1997] ) is structurally similar to our Theorem [2] answer sets 
of the bottom program are evaluated together with the program depending on the bottom 
(here called the residual), hence answer sets of the residual are answer sets of the original 
program. However, the result was based on atom dependencies and did neither consider 
negation nor external atoms. 

In sumary our new Generalized Splitting Theorem has the following advantages. 


• By moving from atom to rule splitting sets, no separate definition of the bottom is 
needed, which just becomes the (rule) splitting set. 

• As regards HEX-programs, splitting is simple (and not troubled) if all atoms that are 
true in an answer set of the bottom also appear in the residual program. Typically, this 
is not the case in results from the literature. 

• Finally, also the residual program itself is simpler (and easier to construct), by just 
dropping rules and adding facts. No rule rewriting needs to be done, and no extra 
facts need to be introduced in the residual program nor in the bottom. 


The only (negligible) disadvantage of the new theorems is that the answer sets of the bottom 
and the residual program may no longer be disjoint; however, each residual answer set 
includes some (unique) bottom answer set. 


7.2 Possible Optimizations 

Evaluation graphs naturally encode parallel evaluation plans. We have not yet investigated 
the potential benefits of this feature in practice, but this property allows us to do parallel 
solving based on solver software that does not have parallel computing capabilities itself 
(‘parallelize from outside’). This applies both to programs with external atoms, as well as to 
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ordinary ASP programs (i.w., without external atoms). Improving reasoning performance 
by decomposition has been investigated by Amir and Mcllraith (2005) ), however, only wrt. 
monotonic logics. 

Improving HEX evaluation efficiency by using knowledge about domain restrictions of 
external atoms has been discussed by |Eiter et al. (20091 ). These rewriting methods yield 
partially grounded sets of rules which can easily be distributed into distinct evaluation units 
by an optimizer. This directly provides efficiency gains as described in the above work. 

As a last remark on possible optimizations, we observe that the data flow between 
evaluation units can be optimized using proper notions of model projection, such as in 
( jGebser et al. 2009a| l. Model projections would tailor input data of evaluation units to 
necessary parts of intermediate answer sets; however, given that different units might need 
different parts of the same intermediate input answer set, a space-saving efficient projection 
technique is not straightforward. 


8 Conclusion 


HEX-programs extend answer set programs with access to external sources through an 
API-style interface, which has been fruitfully deployed to various applications. Providing 
efficient evaluation methods for such programs is a challenging but important endeavor, in 
order to enhance the practicality of the approach and to make it eligible for a broader range 
of applications. In this direction, we have presented in this article a novel evaluation method 
for HEX-programs based on modular decomposition. We have presented new results for the 
latter using special splitting sets, which are more general than previous results and use rule 
sets as a basis for splitting rather than sets of atoms as in previous approaches. Furthermore, 
we have presented an evaluation framework which employs besides a traditional evaluation 
graph that consists of program components and reflects syntactic dependencies among 
them, also a model graph whose nodes collect answer sets that are combined and passed on 
between components. Using decomposition techniques, evaluation units can be dynamically 
formed and evaluated in the framework using different heuristics. Moreover, the answer sets 
of the overall program can be produced in a streaming fashion. The new approach leads in 
combination with other techniques to significant improvements for a variety of applications, 
as demonstrated by Eiter et al. ( |2014a| 2014b| > and |Redl (2014[ ). Notably, while our results 
target HEX-programs, the underlying concepts and techniques are not limited to them (e.g., 
to separate the evaluation and the model graph) and may be fruitfully transferred to other 
rule-based formalisms. 


8.1 Outlook 

The work we presented can be continued in different directions. As for the prototype 
reasoner, a rather straightforward extension is to support brave and cautious reasoning 
on top of HEX programs, while incorporating constructs like aggregates or preference 
constraints requires more care and efforts. Regarding program evaluation, our general 
evaluation framework provides a basis for further optimizations and evaluation strategies. 
Indeed, the generic notions of evaluation unit, evaluation graph and model graph allow to 
specialize and improve our framework in different respects. First, evaluation units (which 
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may contain duplicated constraints), can be chosen according to a proper estimate of the 
number of answer sets (the fewer, the better); second, evaluation plans can be chosen by 
ad-hoc optimization modules, which may give preference to (a combination of) time, space, 
or parallelization conditions. Third, the framework is amenable to a form of coarse-grained 
distributed computation at the level of evaluation units (in the style of Perri et al. (2010) >). 

While modular evaluation is advantageous in many applications, it can also be counter¬ 
productive, as currently the propagation of knowledge learned by conflict-driven techniques 
into different evaluation units is not possible. In such cases, evaluating the program as a 
single evaluation unit is often also infeasible due to the properties of the grounding algo¬ 
rithm, as observed by Eiter et al. (2014a I. Thus, another starting point for future work is a 
tighter integration of the solver instances used to evaluate different units, e.g., by exchanging 
learned knowledge. In this context, also the interplay of the grounder and the solver is an 
important topic. 
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Appendix A Proofs 

Proof of Theorem [7] (, Splitting Theorem) 

Given a set of ground atoms M and a set of rules R, we denote by M\ R = M n gh{R ) the 
projection of M to ground heads of rules in R. 

(=>) Let M G JS(P). We show that (1) M\ R e AS(R) and that (2) M e AS(P \R U 
facts 

As for (1), we first show that M\ R satisfies the reduct fR M ' R , and then that M\ R is indeed 
a minimal model of fR M l R . M satisfies fP M and R C P. Observe that, by definition of 
FLP reduct, fR AI C fP AI . By definition of rule splitting set, satisfiability of rules in R does 
not depend on heads of rules in P\R (due to the restriction of external atoms to extensional 
semantics, this is in particular true for external atoms in R). Therefore fR M ' R = fR M , 
M satisfies fR M l H , and M\ R satisfies fR M \ R . For showing M\r G AS(R), it remains to 
show that M\ r is a minimal model of fR M ' R . 

Assume towards a contradiction that some S C M\ R is a model of fR M ' R . Then there 
is a nonempty set A = M \r\S of atoms with A C gh{R). Let M* = M \ A. We 
next show that M* is a model of fP M , which implies that M (f AS(P). Assume on the 
contrary that M * is not a model of fP M . Hence there exists some rule r G fP M such that 
H(r) D M* = 0, B + {r) C M*, B~(r) D M* = 0 and external atoms in B + {r) (resp.. 
B~ (r)) evaluate to true (resp., false) wrt. M*. S agrees with M* on atoms from gh(R), and 
S satisfies fR M \ R . The truth values of external atoms in bodies of rules in R depends only 
on atoms from gh(R), therefore external atoms in R evaluate to the same truth value wrt. S 
and M*. Therefore r ^ fR M \ R and r G f(P\ R) M . Since r G P\R, H{r ) C gh(P \ R), 
and because M and M* agree on atoms from gh{P \ R), H(r ) H M* = 0 from above 
implies that H (■ r)C\M = 0. Because r G fP M , its body is satisfied in M, and since its head 
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has no intersection with M, we get that fP M is not satisfied by M, which is a contradiction. 
Therefore M* is a model of fP M . As M*cM, this contradicts our assumption that 
M £ AS(P). Therefore S = M\r = X is a minimal model of fR M . 

We next show that M satisfies the reduct /(P \ R U facts(M\ r)) m , and then that 
it is indeed a minimal model of the reduct. By the definition of reduct, f(P \ R U 
facts(M\n)) M = f (P\R) m U facts(M\ r). M satisfies facts (M\r) because M\r C M. 
Furthermore f(P \ R) M C fP M , hence M satisfies f(P \ R) M . Therefore M satisfies 
f(P\RUfacts(M\ B )) M . 

To show that M is a minimal model of f{P \ R U facts (M \ r)) m , assume towards a 
contradiction that some S C M is a model of f(P \RU facts (M\r)) m . Since facts (M\r) 
is part of the reduct, M\r C S, therefore S\ rik (yy = M\r. By definition of rule splitting 
set, satisfiability of rules in R does not depend on heads of rules in P\R, hence S satisfies 
fR M . Because S satisfies f(P \ RU facts {M\r)) m = f(P \ R) M U facts (M\ B ), it also 
satisfies /(P \ R) M . Since S satisfies both fR M , S satisfies fP M = f(P \ R) M u fR M . 
This is a contradiction to M £ _4S(P). Therefore S = M is a minimal model of f(P \ R U 
facts (M\r)) m . 

(<^=) Let M £ AS(P\RUfacts(X)) and let X £ AS(R). We first show that M satisfies 
fP AI , and then that it is a minimal model of fP M . 

As facts X are part of the program P \ PU facts (X), and by definition of rule splitting set, 
P\R contains no rule heads unifying with gh{R), hence we have X = M\ B . Furthermore 
/(P \ J?U facts{X)) M \ facts(X) U fR M = fP M , and as M satisfies the left side, it 
satisfies the right side. To show that M is a minimal model of fP M , assume S C M is 
a smaller model of fP M . By definition of reduct, S also satisfies f(P\ R) M and fR M . 
Since R is a splitting set, satisfiability of rules in R does not depend on heads of rules in 
P\R, therefore fR M = fR M \ R = fR x and S\ g h(R) satisfies fR x . Since S C M, we 
have S\ g h(R) C X. Because X is a minimal model of fR x , C X is impossible 

and = X. Therefore S\ gh (p\ R ) C M\ gh ( P \ R y Because S satisfies f(P \ R) M 

and S\ g h(R) = X, S also satisfies /(P \ flU facts{X)) M . Since S C M, this contradicts 
the fact that M is a minimal model of P \ R U facts (X). Therefore S = M is a minimal 
model of fP M . □ 


Proof of Theorem^) Generalized Splitting Theorem) 

By definition of generalized bottom, the set C = B\R contains only constraints, therefore 
gh{B) = gh(R) and M\ g hm^ = M\ gh (Ry As R C B and B\R contains only constraints, 
AS(B) C AS(R). The only difference between Theorem [I] and Theorem [2] is, that for 
obtaining X, the latter takes additional constraints into account. 

(=>) It is sufficient to show that M \ g h(B) docs not satisfy the body of any constraint in 
C C P if M does not satisfy the body of any constraint in P. Since B is a generalized 
bottom, no negative dependencies of constraints C to rules in P\B exist; therefore if the 
body of a constraint c £ C is not satisfied by M, the body of c is not satisfied by M\ gh ( B y 
As M satisfies P, it does not satisfy any constraint body in P, hence the projection M \ g h(B) 
does not satisfy any constraint body in B\R. 

(y=) It is sufficient to show that an answer set of R that satisfies a constraint body in 
C also satisfies that constraint body in P, which raises a contradiction. As constraints in 
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C have no negative dependencies to rules in P \ B, a constraint with a satisfied body in 
M\ g h(R) also has a satisfied body in M, therefore the result follows. □ 

Proof of Proposition [7] 

Assume towards a contradiction that there exist a non-constraint r £ P, a rule s £ P with 
r —¥ m ,n s, and u! £ U\ r , v' £ U\ s such that (if, v') ^ E. Due to Definition^ r -^ m ,n s 
implies that s has H(s) f 0 and therefore that s is a non-constraint. Definition 14]([b|i then 
implies that J 7 | r = {it'} and U\ s = {i/} (non-constraints are present in exactly one unit). 

Case (i): for r —> n s. Definition [l4](|c]> specifies that for all u £ U\ r and v £ U\ 8 there 
exists an edge (u, v ) £ E, therefore also (u', v') £ E, which is a contradiction. 

Case (ii): for r — s, Definition[l4](|d]) specifies that some u £ U\ r exists such that for 
every v £ U\ s there exists an edge (u, v) £ E\ since U\ r = {it'} and U\ s = {i/}, it must 
hold that (if, v') £ E, which is a contradiction. □ 

Proof of Proposition [2] 

Given two distinct units U\,U 2 £ U, assume towards a contradiction that some 7 £ 
gh(u\) D < 7 / 1 ( 112 ) exists. Then there exists some r £ u± with a £ H(r) and a ~ 7, and 
there exists some s £ U 2 with /3 £ H(s) and j3 ~ 7 . As a ~ 7 and /3 ~ 7 and 7 is ground, 
we obtain a ~ f3\ hence, by Definition [9] ([mil we have r —> m s and s —> m r. As r and s 
have nonempty heads, they are non-constraints. Thus by Proposition [I] there exist edges 
(111,112), (112,111) £ E. As an evaluation graph is acyclic, it follows u\ = u-z, this is a 
contradiction. □ 


Proof of Proposition [j] 

For an Ide-safe program P, the graph £ = ({P}, 0) is a valid evaluation graph. □ 


Proof of Theorem^ 

For any set of rules, let constr(S) = {r £ S \ H(r) = 0} denote the set of constraints in 
S. We say that the dependencies of r £ Q are covered at unit u £ U, if for every rule s £ Q 
such that r —t m ,n s and s £ u, it holds that (u, if) £ E for all v! £ U | s , i.e., u has an edge 
to all units containing s. 

To prove that B = u < is a generalized bottom of P = u- wrt. the rule splitting set 
R = it < \ constr(u < ) as by Definition 12 we prove that (a) R C B C P, (b) B \ R 


contains only constraints, (c) no constraint in B\R has nonmonotonic dependencies to 
rules in P\B, and (d) R is a rule splitting set of P. 

Statement (a) corresponds to it < \ constr(u < ) C u < C u- and u— is defined as 
it- = i^ U it, therefore the relations all hold. For (b), B\R = u K \ (u < \ constr(u < )), 
and as A \ (A \ B) = A (T B, it is easy to see that B \ R = u K C I constr(u < ) and thus 
B \ R only contains constraints. For (c), we show a stronger property, namely that no 
rule (constraint or non-constraint) in B has nonmonotonic dependencies to rules in P\B. 
B = u K is the union of evaluation units V = {v £ U \ v < u}. By Definition 14 ([cj all 
nonmonotonic dependencies r — s are covered at every unit w such that w £ U r . Hence 
if r £ w and w £ V, then either s £ w or s £ tzP holds, and hence s £ w- C it < . As 
P\B = u- \ w < , no nonmonotonic dependencies from B = u K to P\B exist and (c) holds. 
For (d) we know that R = u < \ constr(u < ) contains no constraints, and by Proposition [T| 





48 


Eiter, Fink, Ianni, Krennwallner, Redl, and Schuller 


all dependencies of non-constraints in R are covered by £. Therefore r £ R, r -£ rn ,n s, 
and s £ P implies that s £ R. Consequently, fd) holds which proves the theorem. □ 

Proof of Theorem [5] 

Similar to the proof of Theorem [ 3 ] we show this in four steps; given P = u < , R = 
u'- \ constr{u'~), and B = u'- = u! U u' < , we show that (a) R C B C P, (b) B \ R 
contains only constraints, (c) no constraint in B \ R has nonmonotonic dependencies to 
rules in P \ B, and (d) R is a rule splitting set of P. Let preds £ [u) = {iti,..., Uk) and 
Let V = {v £ U \ v < u'} be the set of units on which v! transitively depends. (Note that 
V C preds £ (u) and u (j V.) As u f< contains all units v! transitively depends on, we have 
B = v! U \Jwzv w. 

For (a), R C B holds trivially, and B C P holds by definition of if and u'- and 
because u' £ preds £ (u). Statement (b) holds, because B \ R removes R from If i.e., it 
removes everything that is not a constraint in B from If therefore only constraints remain. 
For (c) we show that no rule in B has a nonmonotonic dependency to rules in P \ If By 
Definition[l4](|c]i, all nonmonotonic dependencies are covered at all units. Therefore a rule 
r £ w, w £ {u 1 } U V with r —> n s, s £ U implies that either s £ w, or that s is contained 
in a predecessor unit of w and therefore in v! or in V. Flence there are no nonmonotonic 
dependencies from rules in B to any rules not in B, and hence also not to rules in P\B 
and (c) holds. For (d) we know that R contains no constraints and by Proposition [T] all 
dependencies of non-constraints in R are covered by £. Therefore r £ I f r —> m ,n s, s £ P 
implies that s £ R and the theorem holds. □ 

Proof of Proposition [7] 

(=>) The added vertex m! is assigned to one unit and gets assigned a type. Furthermore, the 
graph stays acyclic as only outgoing edges from m' are added. I-connectedness is satisfied, 
as it is satisfied in T and we add no o-interpretation. O-connectedness is satisfied, as m! gets 
appropriate edges to o-interpretations at its predecessor units, and for other i-interpretations 
it is already satisfied in I. 

For FAI intersection, observe that if we add an edge ( m',rrii) to I and it holds that 
rrii £ o-intsx(ui), then m' reaches in T only one o-interpretation at Ui, and due to O- 
connectedness that o-interpretation is connected to exactly one i-interpretation at tq, which 
is part of the original graph T and therefore satisfies FAI intersection. Therefore it remains 
to show that the union of subgraphs of T reachable in I from mi,... ,rrik, contains one 
o-interpretation at each unit in the subgraph of E reachable from u±,... ,Uk■ We make a case 
distinction. 

Case (I): two o-interpretations m* £ o-intsifuf), rrij £ o-intsi[uj ) in the join, with 
1 < * < j < k, have no common unit that is reachable in £ from Ui and from Uj : then 
the condition is trivially satisfied, as the subgraphs of I reachable in I from m, and rn :) , 
respectively, do not intersect at any unit. 

Case (II): two o-interpretations rrii £ o-intsx(ui ), nij £ o-intsx{uj) in the join, with 
1 < 1 < j < k, have at least one common unit that is reachable from w, and from Uj in £. 
Let if be a unit reachable in £ from both Ui and Uj on two paths that do not intersect before 
reaching uJ. From Ui to vf, and from u 3 to vf, exactly one o-interpretation is reachable in 
T from rrii an d rrij, respectively, as these paths do not intersect, is a FAI of u, and as the 
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join is defined, we reach in £ exactly one o-interpretation at unit u' from in, and rrij. Due 
to O-connectedness, we also reach in I exactly one i-interpretation m" at u* from m, and 
?t ij. Now m" is common to subgraphs of X that are reachable in X from in, and rrij, and 
m" satisfies FAI intersection in X. 

Consequently, FAI intersection is satisfied in X' for all pairs of predecessors of m' and 
therefore in all cases. As no vertex m with {( 777 ,, mi),..., (m, m &)} C F exists and and as 
X satisfies Uniqueness, also X' satisfies Uniqueness. 

(<=) Assume towards a contradiction that X' is an i-graph but that the join is not defined. 
Then there exists some FAI u' £ fai(u) such that either no or more than one o-interpretation 
from o-intsi{u ) is reachable in X from some mi, 1 < i < k. As I is an i-graph, due to 
I-connectedness and O-connectedness, if a unit u' is a FAI and therefore v! is reachable in 
£ from Ui, then at least one i-interpretation and one o-interpretation at v! is reachable in 
X from ?77j. If more than one o-interpretation is reachable in X from some to,;, 1 < i < /.:, 
this means that more than one o-interpretation at v! is reachable in X' from the newly added 
i-interpretation m. However, this violates FAI intersection in X', which is a contradiction. 
Hence the result follows. □ 

Proof of Proposition [5] 

(=>) Whenever the join is defined, A! is an i-graph by Proposition [4] It remains to show 
that int(m') + £ AS'(u < ), and that A! fulfills items (a) and (c) of an answer set graph. 
By Theorem W we know that for each u,. uf is a generalized bottom of it < wrt. the set 
Ri = {r £ up | B(r) f=- 0}. For each m, therefore Y £ AS(u < ) iff Y £ AS{u < \ Ri U 
facts (X)) for some A' £ AS (up). As A is an answer set graph, for each mi we know that 
int(mi) + £ AS(uf): hence Y £ AS(m < ) if Y £ AS(u < \Ri U int(mi) + ). Now from the 
evaluation graph properties we know that u < = up U • • • Uttr, and from the construction of 
int(m') and its dependencies in A' we obtain that int(m') + = mf(?77i) + U' • -Uint(mk) + ■ 
It follows that int(m') + £ AS(u < ), which satisfies condition 0- Due to the definition of 
join, condition 0 is also satisfied and A! is indeed an answer set graph. 

(<=) As A' is an answer set graph, it is an i-graph, and hence by Proposition [4] m = 
mi to • • • cxi 777fc is defined. □ 

Proof of Theorem [5] 

We prove this theorem using Proposition [6] We construct £" = (U", E") with U" = U U 
{ufinal}-Ufinal = 0, and E" = Eli { (ufinai , u) | u £ U]. As Ufi na i contains no rules and as 
£" is acyclic, no evaluation graph property of gets violated and £" is also an evaluation graph. 
As A contains no interpretations at Ufi na i and dependencies from units in U are the same in £ 
and £", A is in fact an answer set graph for £". We now modify A to obtain A" as follows. 
We add the set M new = {m \ m = mi cx] • • • ixi m n is defined at Ufi na i (wrt. A)} as i- 
interpretations of Ufi na i and dependencies from each m £ M new to the respective o-interpre- 
tations rn, , 1 < i < ri. By Proposition]^ A" is an answer set graph for £", and moreover A" 
gets input-complete for Ufi na i by construction. As A" is input-complete for U U {ufi na i} and 
output-complete for U, by Proposition |6] we have that AS(P) = i-ints^(ufi na i) = M new . 
As for every join m = mi cx • • • cxi m n , we have int(m) = int(mi)U ■ ■ ■ U int(m n ), to 
complete the proof of the theorem, it remains to show that the join m between mi,... ,m n 
is defined at Ufi na i iff the subgraph A of A reachable from the o-interpretations 777 , in 
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F fulfills \o-ints ^{uf)\ = 1, for each m £ U. As the join involves all units in U, and 
since A" is an answer set graph and thus an i-graph, it follows from the conditions for 
an i-graph that at each m £ U exactly one o-interpretation is reachable from m, and thus 
also from each m,;; thus the condition for A' holds. Conversely, if the subgraph A! fulfills 
| o-intsjn(ui)\ = 1 for each u, £ U, then clearly the FAI condition for the join m being 
defined is fulfilled. □ 


Proof of Proposition [6] 

As Ufi n ai depends on all units in U \ {ufi na i}, due to O-connectedness every i-interpretation 
to £ i-intsA(ufi na i) depends on one o-interpretation at every unit in U \ {ufi na i}. Let 
U \ {ufi na i} = {tii,. .., Uk} and let Mm = {toi, ..., rrik} be the set of o-interpretations 
such that (to, mi) £ F and mi £ o-ints^{ui), 1 < i < k. Then, due to FAI intersection, 
M m contains each o-interpretation that is reachable from to in A, and M m contains only 
interpretations with this property. Hence int{in) + = int(mi) U • • • U int(mk), and due to 


condition Q in Definition 19 we have int{m) = int(m) + . By the dependencies of Ufi na i, 
we have u^ nal = P, and as Ufi na i is input-complete, we have that AS(P) = AS(u^ nal ) = 
{int(m) + | to £ i-ints a{u fi na f)}. As int(m) = int[rn) + for every i-interpretation to at 
Ufinai, we obtain the result. □ 

Proof of Proposition [ 7 ] 

The proposition follows from Property [I] which asserts that the grounding P' has the same 
answer sets as P, and from the soundness and completeness of the evaluation algorithm for 
ground HEX-programs as asserted by Property [2] □ 

Proof of Theorem^ 

We show by induction on its construction that I = (M, F , unit, type , int) is an answer set 
graph for £, and that at the beginning of the while-loop I is input- and output-complete for 

V\U. 

(Base) Initially, I is initially and V = U, hence the base case trivially holds. 

(Step) Suppose that X is an answer set graph for £ at the beginning of the while-loop, 
and that it is input- and output-complete for V \ U. As the chosen u only depends on 
units in V \ U, it depends only on output-complete units. For a leaf unit u, (b) creates an 


empty i-interpretation and therefore makes u input-complete. For a non-leaf unit u, the first 
for-loop [(cjl builds all possible joins of interpretations at predecessors of u and adds them 
as i-interpretations to X. As all predecessors of u are output-complete by the hypothesis, 


this makes u input-complete. Now suppose that Condition (d) is false, i.e., u ^ Ufi na i. Then 
the second for-loop [(ej| evaluates u wrt. every i-interpretation at u and adds the result to 
u as an o-interpretation. Due to Proposition [7] EvaluateLDESafe(u, int(m')) returns 
all interpretations o such that o £ {X \ int(m') \ X £ AS(u U facts (int (to'))}. As u 
depends on all units on which its rules depend, and as i-interpretations contain all atoms 
from o-interpretations of predecessor units (due to condition (JcJ of Definition p~9|), we have 
EvaluateLDESafe(u, int(m')) = EvaluateLDESafe(m, int(m') + ). By Theorem[3] 
u K is a generalized bottom of u-, and by the induction hypothesis int(in') + £ AS(w < ); 
hence by Theorem[2] we have that int{m') + U o £ AS(u-). Consequently, adding a new 
o-interpretation to with interpretation int.(m) = o and dependency to m! to the graph X 
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results in int(m) + £ AS(u-), and adding all of them makes I output-complete for u. 
Finally, in[(f)]u is removed from U\ hence at the end of the while-loop X is an answer set 
graph and again input- and output-complete for V\U. 

It remains to consider the case where Condition (d) is true. Then Ufi na i was made 
input-complete, which means that all predecessors of Ufi na i are output-complete. As Ufi na i 
depends on all other units, we have U = {ufi na i} and the algorithm returns i-ints a{ u )\ by 
Proposition]^] it thus returns AS(P), which will happen in the | Vj-th iteration of the while 
loop. □ 


Appendix B Example Run of Algorithm [2] 

We provide here an example run of Algorithm [ 2 ] for our running example. 


Example 28 ( ctd .) 

Consider an evaluation graph £) which is £2 plus Ufi na i = 0 , which depends on all other 
units. Following Algorithm [ 2 ] we first choose u = « i, and as u-\ has no predecessor 
units, step |(b)| creates the i-interpretation mi with mf(mi) = 0 . As ui Ufi na i, we 


continue and in loop (e) obtain O = AS(iti) = {{swim(ind)}, {swim(outd)}}. We 
add both answer sets as o-interpretations m2 and m3 and then finish the outer loop with 
U = {u2, M3, U4, Ufi na i}. In the next iteration, we could choose u = U2 or u = M3; assume 
we choose M2. Then preds £ {v,2) = {mi} and k = 1 , and we enter the loop |(c)| and build all 
joins that are possible with o-interpretations at ui (all joins are trivial and all are possible), 
i.e., we copy the interpretations and store them at w> as new i-interpretations TO4 and 
m3. In the loop |(e)| we obtain O = EvaluateLDESafe(m2, {swim(ind)}) = 0 , as 
indoor swimming requires money which is excluded by eg £ M2. Therefore i-interpreta- 
tion {swim(ind)} yields no o-interpretation, indicated by /. However, we obtain O = 
EvaluateLDESafe(m2, {swim(outd)}) = { 0 }: as outdoor swimming neither requires 
money nor anything else, i-interpretation {swim(outd)} derives no additional atoms and 
yields the empty answer set, which we store as o-interpretation me at M2; the iteration ends 
with U = {M3, M4, Ufi na i}. In the next iteration we choose u = M3, we add in loop |(c)| i-inter- 
pretations 7717 and ms to M3, and in loop |(e)| o-interpretations mg,..., mi2 to M3; the iteration 
ends with U = {114. Ufi na i } ■ In the next iteration we choose u = M4; this time we have 
multiple predecessors, and in loop |(c)| we check join candidates me tx mg and me 1x1 mio, 
which are both not defined. The other join candidates are me xi mu and me x m.12, which 
are both defined; we thus add their results as i-interpretations m.13 and m.14, respectively, to 
M4. The loop [(e)] computes then one o-interpretation mis f° r i-interpretation m.13 an d no 
o-interpretation for 77714. The iteration ends with U = {ufi na i}. In the next iteration, we 
have preds £ (ufi na i) = {mi, M2, M3, M4} and the loop (c) checks all combinations of one 


o-interpretation at each unit in preds £ (ufi na i )• Only one such join candidate is defined, 
namely m = m3 x me x mu x 77745, whose result is stored as a new i-interpretation 
at Ufinai- The check [(d)] now succeeds, and we return all i-interpretations at Ufi na i\ i.e., 
we return {m} = {{ swim(outd ), goto(altD), ngoto(gansD) 1 go, need{loc, yogamat)}}. 
This is indeed the set of answer sets of P SW im- 
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Algorithm 3: AnswerSetsOnDemand 
Input: evaluation graph £ for program P, with final unit Ufi na i = 0 
Output: the answer sets of P 

initialize global storage S 

repeat 

rriout '■= GetNextOutputModel (ufi na i) 
if m out UNDEF then output m out 
until m out = undef 


Appendix C On Demand Model Streaming Algorithm 

Algorithm [ 2 ] fully evaluates all other units before computing results at the final evaluation 
unit Ufi na i, and it keeps the intermediate results in memory. If we are only interested in one 
or a few answer sets, many unused results may be calculated. 

Using the same evaluation graph, we can compute the answer sets with a different, 
more involved algorithm AnswerSetsOnDemand (shown in Algorithm[3]i that operates 
demand-driven from units, starting with Ufi na i, rather than data-driven from completed units. 
It uses in turn several building blocks that are shown in Algorithms [4}j6] 

AnswerSetsOnDemand calls Algorithm GetNextOutputModel for ufi na i and 
outputs its output models, i.e., the answer sets of the input program P given by the evaluation 
graph £, one by one until it gets back undef. Like Algorithm [2] GetNextOutput¬ 
Model builds in combination with the other algorithms an answer set graph A for £ that is 
input-complete at all units, if all statements marked with ’(+)' are included; omitting them, 
it builds A virtually and has at any time at most one input and one output model of each 
unit in memory. 

Roughly speaking, the models at units are determined in the same order in which a right- 
to-left depth-first-traversal of the evaluation graph £ would backtrack from edges. This is 
because first all models of the subgraph reachable from a unit u are determined, then models 
at the unit u, and then the algorithm backtracks. The models of the subgraph are retrieved 
with GetNextInputModel one by one, and using NextAnswerSet the output models 
are generated and returned. The latter function is assumed to return, given a HEX-program 
P and the i-th element in an arbitrary but fixed enumeration / 1; / 2 ,..., I m of the answer 
sets of P (without duplicates), the next answer set / ;+1 , where by convention Iq = UNDEF 
and the return value for I m is UNDEF. This is easy to provide on top of current solvers, and 
the incremental usage of NextAnswerSet allows for an efficient stateful realization (e.g. 
answer set computation is suspended). 

The trickiest part of this approach is GetNextInputModel, which has to create locally 
and in an incremental fashion all joins that are globally defined, i.e., all combinations of 
incrementally available output models of predecessors which share a common predecessor 
model at all FAIs. To generate all combinations of output models in the right order, it uses 
the algorithm EnsureModelIncrement. 

The algorithms operate on a global data structure S = (£, A, curl, curO, refsO) called 
storage , where 

• £ = ( U , E) is the evaluation graph containing Ufi na i £ U, 
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Algorithm 4: GetNextOutputModel(u) 
Input: u: unit 

Output: m ou t' next omodel at u or UNDEF 


(+) 


if refsO(u ) > 0 then return UNDEF 

if cur\(u) = undef then curi(u) := GetNextInputModel(-u) 
while curl(u) 7 ^ UNDEF do 

curO(u) := NextAnswerSet(uU facts (cur l(u)), curO(u )) 
if curO(u) 7 ^ undef then 

add omodel curO(u ) to A with dependency to curl(u) 
return curO(u) 


cur\(u) := GetNextInputModel(u) 


return undef 


Algorithm 5: EnsureModelIncrement(u, at) 

Input: u: unit with {m ,..., Uk} = preds £ (u), at: index 1 < at < k 
Output: at': index at < at' < k or UNDEF 

repeat 

refsO(u at ) := refsO(u at ) - 1 

TO := GETNEXTOUTPUTMODEL(u at ) 

if to = UNDEF then at := at + 1 

else 

refsO(u at ) := refsO(u a t ) + 1 

return at 

until at = k + 1 
return undef 


• A = (M, F, unit , type, int) is the (virtually built) answer set graph, 

• curl :[/—>■ M U {undef} and curO : U —> M U {undef}, are functions 
that informally associate with a unit u the current input respectively output model 
considered, and 

• refsO : U —> N U { 0 } is a function that keeps track of how many current input models 
point to the current output model of u; this is used to ensure correct joins, by checking 
in GetNextOutputModel that the condition (IG-F) for sharing models in the 
interpretation graph is not violated (for details see Section | 5 . 1 . 2 | and Definition [ 17 }. 

Initially, the storage S is empty, i.e., it contains the input evaluation graph £, an empty 
answer set graph A, and the functions are set to curl(u) = UNDEF, curO(u) = UNDEF, 
and refsO(u ) = 0 for all u GU. The call of GetNextOutputModel for Ufi na i triggers 
the right-to-left depth-first traversal of the evaluation graph. 

We omit tracing Algorithm AnswerSetsOnDemand on our running example, as this 
would take quite some space; however, one can check that given the evaluation graph £ 2 , it 
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Algorithm 6: GetNextInputModel(w) 
Input: u: unit 

Output: m ou t'. imodel at u or UNDEF 


(a) if preds £ (u ) = 0 then 

if curi(u) = undef then 
(+) | add imodel 0 at u to A 

return 0 

else return undef 


let {u\,.. . ,Uk} = preds £ ( u ) /* assume this order is fixed for each unit u *1 

if curi(u) ^ undef then 

at := EnsureModelIncrement(/u, 1) 
if at = undef then return undef 


| at := at — 1 
else at := k 
(b) while at ^ 0 do 

if curO { u a t ) 7^ undef then 
refso ( u a t ) ■= refsO ( u at ) + 1 

at := at — 1 

else 

m := GETNEXTOUTPUTMODEL(u at ) 
if m = UNDEF then 

if at = k then return undef 
at := EnsureModelIncrement(m, at + 1) 
if at = undef then return undef 
else 

refsO { u at ) := refsO ( u at ) + 1 
at := at — 1 


let to = curO(ui) x ■ ■ • tx curO{uk) 

(+) add imodel to to A with dependencies to curO{u \),..., curO(uk) 

return m 


correctly outputs the single answer set 

/ = { swim(outd ), goto(altD), ngoto(gansD), go, need(loc, yogamat)}. 

Formally, it can be shown that given an evaluation graph £ = (U, E ) of a program P such 
that £ contains a final unit Ufi na i = 0 , Algorithm AnswerSetsOnDemand outputs one 
by one all answer sets of P, without duplicates, and that in the version without (+)-lines, it 
stores at most one input and one output model per unit (hence the size of the used storage is 
linear in the size of the ground program grnd(P)). 
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Appendix D Overview of Liberal Domain-Expansion Safety 

Strong domain-expansion safety is overly restrictive, as it also excludes programs that 
clearly are finitely restrictable. In this section we give an overview about the notion and 
refer to fEiter et al. 2014 a)> for details. 


Example 29 

Consider the following program: 

ri: p(a). r 3 : s(Y) 4— p(X), &concat[X , a](Y). 

r 2 : q(aa). r 4 : p(X) «- s(X), q{X). 

It is not strongly safe because Y in the cyclic external atom &concat[X. a](Y) in r3 does 
not occur in an ordinary body atom that does not depend on &concat[X , a](Y). However, 
P is finitely restrictable as the cycle is “broken” by dom(X) in r 4 . 



To overcome unnecessary restrictions of strong safety in ( jEiter et al. 2006j >, liberal domain- 
expansion safety (lde-safety) has been introduced ( jEiter et al. 2014a 1 , which incorporates 
both syntactic and semantic properties of a program. The details of the notion are not 
necessary for this paper, except that all lde-safe programs have finite groundings with the 
same answer sets; we give here a brief overview. 

Unlike strong safety, liberal de-safety is not a property of entire atoms but of attributes, 
i.e., pairs of predicates and argument positions. Intuitively, an attribute is lde-safe, if the 
number of different terms in an answer-set preserving grounding (i.e. a grounding which 
has the same answer sets if restricted to the positive atoms as the original program) is finite. 
A program is lde-safe, if all its attributes are lde-safe. 

The notion of lde-safety is designed in an extensible fashion, i.e., such that several safety 
criteria can be easily integrated. For this we parametrize our definition of lde-safety by a 
term bounding function (TBF), which identifies variables in a rule that are ensured to have 
only finitely many instantiations in the answer set preserving grounding. Finiteness of the 
overall grounding follows then from the properties of TBFs. 

For an ordinary predicate p£V, let p fi be the i-th attribute ofp for all 1 < i < ar{p). 
For an external predicate &g £ X with input list X in rule r, let &g[X] r \ T i with T £ {1, 0} 
be the i-th input resp. output attribute of &g [X] in r for all 1 < i < arp{&g). For a ground 
program P, the range of an attribute is, intuitively, the set of ground terms which occur in 
the position of the attribute. Formally, for an attribute p\i we have range{p\i, P) = {U \ 
p(ti, ..., £ ot .( p )) £ A(P)}; for an attribute &g[X] r \ T i we have range(&g[X] r \ T i, P) = 
{xf | &g [x 1 ](x°) e EA(P)}, where x s = xf ,..., x s ar . s{&g) . 

We use the following monotone operator to compute by fixpoint iteration a finite subset 
of grnd(P) for a program P: 

Gp(P') = U i r0 I 31 ^ A ( P ') A B + (r 9 )}, 

r£P 


where A(P') = {Ta, Fa | a £ A(P')} \ {Fa | a <— . £ P} and r6 is the ground instance 
of r under variable substitution 9 : V —> C. Note that in this definition, / might be partial, 
but by convention we assume that all atoms which are not explicitly assigned to true are 
false. That is, Gp takes a ground program P' as input and returns all rules from grnd(P) 
whose positive body is satisfied under some assignment over the atoms of IT. Intuitively, the 
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operator iteratively extends the grounding by new rules if they are possibly relevant for the 
evaluation, where relevance is in terms of satisfaction of the positive rule body under some 
assignment constructable over the atoms which are possibly derivable so far. Obviously, the 
least fixpoint Gp ( 0 ) of this operator is a subset of grnd(P)', we will show that it is finite 
if P is lde-safe according to our new notion. Moreover, we will show that this grounding 
preserves all answer sets as all omitted rule instances have unsatisfied bodies anyway. 

Example 30 

Consider the following program P: 

r i: s(a). r2- dom(ax). r 3 : dom(axx). 

r 4: s(Y) 4— s(X), &concat[X,x](Y), dom(Y). 

The least fixpoint of Gp is the following ground program: 

r[: s(a). r' 2 : dom(ax). r' 3 : dom(axx). 

r’ A : s(ax ) 4— s(a), &concat[a,x](ax), dom(ax). 

r' 5 : s(axx) 4— s(ax), &concat[ax,x](axx), dom(axx). 

Rule r'| is added in the first iteration and rule r'- in the second. 

Towards a definition of lde-safety, we say that a term in a rule is bounded , if the number 
of substitutions in Gp ( 0 ) for this term is finite. This is abstractly formalized using term 
bounding functions. 

Definition 21 (Term Bounding Function (TBF )) 

A term bounding function, denoted b(P, r, S, B), maps a program P, a rule r £ P, a set S 
of (already safe) attributes, and a set B of (already bounded) terms in r to an enlarged set 
of (bounded) terms b(P, r, S, B ) D B, such that every t £ b{P , r,S,B) has finitely many 
substitutions in Gp ( 0 ) if (i) the attributes S have a finite range in Gp ( 0 ) and (ii) each term 
in terms (r) (T B has finitely many substitutions in Gp ( 0 ). 

Intuitively, a TBF receives a set of already bounded terms and a set of attributes that are 
already known to be lde-safe. Taking the program into account, the TBF then identifies and 
returns further terms which are also bounded. 

The concept yields lde-safety of attributes and programs from the boundedness of vari¬ 
ables according to a TBF. We provide a mutually inductive definition that takes the empty 
set of lde-safe attributes Sq(P) as its basis. Then, each iteration step n > 1 defines first 
the set of bounded terms B n (r, P , b) for all rules r, and then an enlarged set of lde-safe 
attributes S n (P). The set of lde-safe attributes in step n- 1-1 thus depends on the TBF, which 
in turn depends on the domain-expansion safe attributes from step n. 

Definition 22 (Liberal Domain-Expansion Safety ) 

Let & be a term bounding function. The set B n (r, P, b) of bounded terms in a rule r € P in 
step n > 1 is B n {r , P, b) = (J ?>0 Bn,j (c P, b) where B n 3 (r, P,b) = 0 and for all j > 0 , 
B n j + 1 (r, P, b) = b(P , r, S n _i(P), B n j). 

The set of domain-expansion safe attributes S oa (P) = Uj>o Si{P) of a program P is 
iteratively constructed with So(P) = 0 and for n > 0 : 

• p\i£ S n +i(P) if for each r £ P and atomp(( 1 ,..., t ar ( p )) £ H(r), we have that term 
£j £ B n+ i(r, P, b), i.e., t t is bounded'. 
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• &0[X] J .|‘ I *eS , Tl+ i (P) if each X,; is a bounded variable, or X, is a predicate input 
parameter p and p \ 1 ,..., p \ ar (p) £ S n (P); 

• &5[X] r |' 0 *eS , n+ i (P) if and only if r contains an external atom &g[X](Y) such that 
Yj is bounded, or &g[X] r f,!,..., &g[X] r \ Y ari{&g) £ S n (P). 


A program P is liberally domain-expansion (Ide) safe, if it is safe and all its attributes 
are domain-expansion safe. 

A detailed description of liberal safety is beyond the scope of this paper. However, it is 
crucial that each liberally domain-expansion safe HEX-program P is finitely restrictable, i.e., 
there is a finite subset P g of grndc(P) s.t. AS(P g ) = AS(grnd c (P)). A concrete ground¬ 
ing algorithm GroundHEX is given in (Eiter et al. 2014a|; we use GROUNDHEX(P) in 
this article to refer to a finite grounding of P that has the same answer sets. 





